arXiv: 1508.01 llOvl [cs.CC] 5 Aug 2015 


Symmetries of matrix multiplication algorithms. I. 


Vladimir P. Burichenko 


Institute of mathematics of National Academy of Sciences of Belarus 
Kirov Street 32a, Gomel 246000, Republic of Belarus 
vpburich@gmail.com 

Abstract 

In this work the algorithms of fast multiplication of matrices are considered. To any 
algorithm there associated a certain group of automorphisms. These automorphism 
groups are found for some well-known algorithms, including algorithms of Hopcroft, 
Laderman, and Pan. The automorphism group is isomorphic to S 3 x Z 2 and S 4 for 
Hopcroft anf Laderman algorithms, respectively. The studying of symmetry of algo¬ 
rithms may be a fruitful idea for finding fast algorithms, by an analogy with well-known 
optimization problems for codes, lattices, and graphs. 
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1 Introduction 

In the present work we study algorithms of fast multiplication of matrices. This work is a 
continuation of the previous work of the author [13] (but it can be read idependently of [13]. 
It is even preferable to read the present work before [13], because some basic concepts are 
exposed here better than in [13]). 

In 1969 V. Strassen [42] found an algorithm for multiplication of two N x N matrices, 
requiring 0(N T ) (or, more exactly, < 4.71V 1 ") arithmetical operations; here r = log 2 7 = 
2.808.... (Recall that the usual algorithm (“multiplying a row by a column”) requires 2 N 3 — 
N 2 operations). This algorithm is based on the fact, discovered by Strassen, that two 2x2 
matrices with non-commuting elements, i.e., matrices over an arbitrary associative ring R , 
can be multiplied using only 7 multiplications in R. 

Later some algorithms, asymptotically faster than Strassen’s, were found. We give a very 
short survey of the related works in the end of this section. 

The subject of the present work is studying symmetry of algorithms. The author thinks 
that using symmetry may be a fruitful way to find good algorithms. 

Very short and clear exposition of the Strassen algorithm (appropriate for a student) may 
be found in some textbooks on linear algebra or computer algorithms. See, for example, [32], 
§1.4, Ex. 12, or [1], §6.2. An introduction to the whole area of fast matrix multiplication may 
be found in book [12] or survey [34], The books [22], [9], and Section 4.6.4 of [29] also should 
be mentioned. Nevertheless, the author tried to make the present work self-contained, so in 
this Introduction and the next section all necessary concepts, related to matrix multiplication 
algorithms, are recalled. Also, in Subsection 1.3 are contained some directions concerning 
literature in algebra. 
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1.1 Definition of an algorithm 

An algorithm for the multiplication of matrices of given size may be described either in 
computational (i.e., as a sequence of computations), matrix, or tensor form. 

To give an example of an algorithm in computational form, we recall the description of 
the Strassen algorithm. Let R be arbitrary (associative) ring, and let 

X — ( Xu Y = (y 11 

\x 2 i X22 J ’ y/21 2/22/ 

be matrices over R. Consider the following products: 

Pi = X U (y l2 + 2/22), P2 = (xil - X 12 )U22, P 3 = (-X21 + X 2 2)yiu 

Pa = ^22(2/11 + 2/21), P 5 = {xu + X22) (2/11 + 2/22), 

P 6 = {x\l + X 2 l)(yil - 2/12), P 7 = {xi2 + X 22 ) (2/21 - 2/22)- 
Next, take linear combinations 

2 ll = -P 2 - Pa + P 5 + P 7 , Z 12 =P 1 -P 2 : 

Z21 = -P 3 + Pa, Z22 = -Pi -P3+P5- Pe¬ 
lt is easy to check that these z l3 are nothing else but the elements of the matrix Z = XY. 
Thus, we have computed the product of X and Y, using only 7 multiplications (but 18 
additions/subtractions) in R. 

Further, describe what is the matrix form of an algorithm. Let m, n, p, and r be natural 
numbers, K be a field. Take symbols x t] and y 3 k, where 1 < % < m, 1 < j < n, 1 < k < p 
(■mn + np symbols total), and let 


A = K(x i 3 ,y jk | i,j, k) 

be the free associative algebra over K generated by these symbols. Next, suppose we are 
given field elements a l 3 i, bj k i, c ik i G K, for 1 < i < m, 1 < j < n, 1 < k < p, 1 < l < r; we 
may think of them as elements of 3r matrices 


Q'i ( ^/jI ) 1 <t<u). l<j<n, {bjkl)l<j<n, l<k<p, 

Cl {Cikl)l<i<m, l<k<p, l 1, . . . , T. 

Suppose that the following mp relations in A are true: 

r n 

'^2 C ikl{ Q'uvl'X'uv)( < ^ ^ ^ vwlVvw ) E %ijVjk ? (1) 

/=1 1 <u<m l<v<n j =1 

l<v<n l<w<p 

for all 1 < i < m, 1 < k < p. Then we say that the set of r triples of matrices 


A = {( a h bi,c t ) | l = 1, • • • ,r} 
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is a bilinear (or noncommutative ) algorithm over K for multiplication of an m x n matrix 
by an n x p matrix, requiring r multiplications (also called an algorithm of length r or of 
bilinear complexity r). 

Indeed, let R be an arbitrary (associative) algebra over K, and let Q = {q l3 ) and S = (sjk) 
be m x n and n x p matrices, respectively, over R. Then one can compute their product 
T = QS in the following way. First compute all linear combinations 

di ^ ^ a uv iq uv , fi ^ ^ b vw is vw , 

l<w<m l<v<n 

1 <v<n l<w<p 

for all 1 < l < r; then compute all products pi = difi, and finally compute all linear 
combinations 

r 

tik — ^ ^ c iklPl * 

1=1 

for all 1 < i < m, 1 < k < p. Then it follows from the relations (1) that t' ik = tik are 
precisely the elements of T. Thus, we have computed T, using r “nontrivial” (also called 
non-scalar) multiplications in R (here a scalar multiplication means a multiplication by an 
element of K). (It is well known that for algorithms of matrix multiplication the number of 
multiplications is most important. In particular, if there exists a non-commutative algorithm 
A for multiplication of an m x n matrix by an n x p matrix, requiring r multiplications, 
then there is an algorithm for multiplication of two N x N matrices of complexity 0(N T ), 
where r = 3 log mnp r. On the other hand, the number of additions/subtractions and scalar 
(i.e., by an elements of K) multiplications in A affects only the constant factor in 0(N T ). 
The details may be found in the literature). 

Example. It is easy to see that the Strassen algorithm may be written in matrix form 
as the following set of seven triples of matrices: 



We will denote the Strassen algorithm by S. 

A description of what is an algorithm in tensor form will be given in Section 2. 

1.2 Motivation, the aim of the work, and the results 

By r^(m, n,p) we denote the minimal number of multiplications in a bilinear algorithm over 
a held K for multiplication of an m x n matrix by an n x p matrix. In principle, r^(^, n,p) 
may depend on K, but the author does not know any particular example of m, n, p, K x 
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and K 2 such that rK 1 (rn,n,p) ^ r^ 2 (m, ) (but, in the author’s opinion, such examples 

certainly must exist). 

It is widely recognized that finding of rK(m,n,p) for small m, n, p is an important 
problem, both from theoretical and practical viewpoint. The greatest interest at the moment 
is attracted by r(3, 3, 3). 

For small m, n, p the following estimates are known (for any K ). 

• r(2, 2, 2) = 7. The inequality r(2, 2, 2) < 7 follows from the existence of the Strassen 
algorithm. The opposite inequality r(2,2,2) > 7 was first proved in [46], and later 
several other proofs were found. 

• r(2,2, 3) = 11, r(2,2,4) = 14, 17 < r(2,2,5) < 18. Here the upper estimates easily 
follow from the Strassen algorithm, and the lower ones were proved by V.B.Alekseev 
in works [2], [3], [4], respectively. 

• 14 < r(2, 3, 3) < 15, 19 < r(3, 3, 3) < 23. Here the upper estimates follow from the 
algorithms contained in the works [23] and [33] respectively (we recall these algorithms 
in Sections 6 and 5). The lower estimates were proved by Blaser in works [7] and [ 8 ], 
respectively. 

It is also well known that r(m,n,p ) is symmetric in m, n, and p, and that r(m,n, 1) = mn. 

It should be noted that in the case when K = GF( 2) = {0,1}, or if the coefficients of 
algorithms are supposed to be integers, there are some further results (see [23] and [24]). 

It may be a good idea in the search for economical algorithms that such algorithms may 
have many symmetries, that is, a large automorphism group. (It will be explained later in 
the article what we mean by an automorphism of an algorithm). Note that one faces the 
similar situation when studying codes, lattices, or graphs. Good (that is, dense) lattices 
and codes often have large group of automorphisms (see [15] for numerous examples of this 
phenomenon). Similarly, the graphs satisfying certain regularity conditions (distance regular 
graphs, especially those with “extremal” set of parameters) often have large automorphism 
group; see [ 11 ]. 

In [13] the author has proved that the Strassen algorithm S has the automorphism group 
Aut(iS) = S 3 x S 3 , or even S 3 x D 6 , if we consider automorphisms in some “extended” sense. 
Here S 3 is the symmetric group on 3 letters, and Dq is the dihedral group of order 12 (i.e., 
the group of all symmetries of a regular hexagon). 

It should be noted that S is, in a sense, unique: any other algorithm for multiplication of 
two 2 x 2 matrices requiring 7 multiplications is conjugate to S under certain transformation 
group. See [20], [ 21 ], 

Before trying to find good algorithms with large automorphism groups in unknown cases 
(say, for multiplication of 3 x 3 matrices), it is a reasonable first step to study automorphisms 
of some good algorithms known so far. This is the aim of the present work. 

J.E.Hopcroft found an algorithm for multiplying of 3 x 2 matrix by a 2 x 3 matrix, 
requiring 15 multiplications. This algorithm is described in [23], and more accurately in [24], 
We denote the Hopcroft algorithm by FL. 

J.Laderman [33] found an algorithm for multiplication of two 3x3 matrices requiring 23 
multiplications. We denote this algorithm by C. 

V.Ya.Pan (see, for example, [39]) described several algorithms for multiplication of matri¬ 
ces of arbitrary size, known as the trilinear aggregation algorithms. The most known of them 
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is an algorithm for multiplication of two n x n matrices, where n = 2 m is even, requiring 
(n 3 — 4n)/3 + 6 n 2 multiplications. We denote this algorithm by 1 D i m . 

One of the main results of the present work is the following theorem. 

Theorem 1.1 Let TL. C, and, V 2m be the algorithms of Hopcroft, Laderman and Pan, men¬ 
tioned above. Then 

Aut (H) = S 3 xZ 2 , 

Aut (£) = S A , 

and 

Aut (fP 2 m) — S rn X Z 2 x S 3 . 

(Of course, we will give a description of automorphism groups, mentioned in this theorem, 
not only up to isomorphism, but in an explicit form). 

1.3 Some further remarks 

Remark 1. Since Strassen’s work, other estimates for asymptotic complexity of matrix 
multiplication (better than 0(N 2 ' 81 )) were found. The authors who contributed to these 
investigations are (approximately in chronological order) Pan, Bini/Capovani/Lotti/Romani, 
Schonhage, Strassen, and Coppersmith/Winograd. The most significant progress was made 
due to the so-called “laser method” of Strassen. The details and references may be found 
in the literature, see for example Chapter 15 of [12] and the introduction to [45]. The 
most recent works belong to Stothers [41], Vassilevska-Williams [44] (see also [45]), and 
Zhdanovich [49]. In [44] and [49], independently, the estimation 0(N UJ ) with to < 2.373 was 
proved. 

It should be said that in all these estimations the constant factor in 0(N U ) is very large, so 
that the corresponding algorithms are only of theoretical interest and are useless in practice. 
For practical purposes only the following algorithms may be used: the usual algorithm, the 
Strassen algorithm, the Pan trilinear aggregation method, and the “compound” algorithms 
(which will be mentioned in the next remark). 

Remark 2. There are other types of algorithms for matrix multiplication, different from 
bilinear algorithms as described above. Namely, there are 

• commutative (or quadratic ) algorithms, which may be used if we suppose that elements 
of matrices belong to a commutative ring; see [47] or [36] for the examples of such 
algorithms, and §14.1 of [12] for general definition; 

• approximate algorithms, like in [ 6 ] (see [12], §15.2 for further explanations); 

• “compound” bilinear algorithms, that is, the algorithms assembled, in an appropriate 
way, from several algorithms of smaller formats. The work [18] contains a survey of 
such algorithms. 

The algorithms of all these three types are not considered in the present work. 

Remark 3. The present text is written in a manner a bit different from the usual journal 
article. The author means that he gives more details than it is usual in a journal. So the 
reader may find some places trivial. The reason is that the author wishes that the text could 



V.P.Burichenko, Symmetries of matrix multiplication algorithms. I 


6 


be readable both by specialists in algebra and by computer scientists; but these specialists 
may have modest background in computer science or algebra, respectively. 

The author would like to give some references / reading suggestions for readers who may 
be not very experienced in algebra (say, computer scientists). 

The reader can use textbooks [31], [5], and [32] as a basic course in general and linear 
algebra (including the basics of the group representation theory). The book [48] contains a 
very lucid exposition of multilinear algebra (i.e., the theory of tensors). The last chapter of 
[32] is also devoted to multilinear algebra. The book [16] is a classical (but not elementary) 
source for group representation theory); chapters 1 and 2 are especially recommended to 
the reader. There is also an elementary and application-oriented textbook [26]. Finally, we 
should list some graduate-level algebra courses, namely [35], [25], and [19]. 

Structure of the work. The work is organized as follows. In Section 2 we recall the 
relations between matrix multiplication algorithms and decompositions of tensors. Section 3 
contains general considerations on symmetry of tensors and algorithms. In a long Section 4 
we find the isotropy group of the structure tensor of matrix multiplication map (which 
is a necessary preliminary step for studying automorphisms of any particular algorithm). 
In Sections 5 and 6 we find automorphism groups of Laderman and Hopcroft algorithms, 
respectively. 

In Part II of the work Pan’s trilinear aggregation algorithm, and some other topics, will 
be considered. 

Acknowledgement. The author thanks A.S.Kleshchev for useful literature directions. 


2 Tensor form of an algorithm 

Let Vi,..., Vi be vector spaces over a held K , V — V\ ® ® V) be their tensor product. 

A tensor t e V is decomposable if t — U]jg) ... <g) Vi, for some Vi e V), i — 1,..., l. We will 
consider representations of a tensor t G V in the form t = ti + ... + t s , where ti,,t s are 
decomposable tensors. The least possible length s of such a representation is called the rank 
of t, and is denoted by rk(f). Obviously, rk(f) = 1 if and only if t is decomposable. 

In the situation described the following terminology is also used: the set {t \,... ,t s } is 
called an algorithm (of length s), computing t. 

By M mn (K ) (or just M mn ) we denote the space of all m x n matrices over K. The basis 
of M mn (K) is (e t j | 1 < i < m, 1 < j < n), where e t] are usual matrix units. 

In the sequel an important role is played by the tensor 

(m, n, p) = £ Cij Of) e.ji; tb f’A'i EE M mn Eb Mnp ® Mpm ■ 

1 <i<m, 1 <k<p 

(Here two remarks concerning notation are in order: (1) Note that we have abused notation 
a bit, by using the similar symbols and for elements of different spaces; (2) in [13] the 
tensor (■ m,n,p ) was denoted by S(m,n,p). The notation (■ m,n,p ), which is now classical, is 
due to Schonhage.) 

It is a classical fact (first established by Strassen) that there is a Injection between the 
set of all algorithms for multiplication of an m x n matrix by an n x p matrix requiring r 
multiplications, and the set of all algorithms of length r computing (■ m,n,p ). The following 
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proposition gives an explicit description of this bijection. Note that in condition (a) of this 
proposition we think of matrices as tables whose elements are elements of K, so that 

® 1 <j<n i 

whereas in condition (b) we think of matrices as elements of linear spaces, so that 



As usually, by x* and 5 a b we denote the transposed matrix and the Kronecker delta 
symbol. 


Proposition 2.1 Let m,n,p,r G N, and let K be a field. Let 

®Z iflijl) l<i<m, l<j<n > hl iPjkl)l<j<n, l<k<p j ai~ld Cl \pikl)\<i<m, 1 <k<p j 

where l = 1,_, r, be matrices over K, of sizes m x n, n x p, and m x p, respectively. 

Then the following three conditions are equivalent: 

(a) {(a/, bi, ci) | / = 1,..., r} is a bilinear algorithm over K for multiplication of an m x n 
matrix by an n x p matrix; 

(b) {ai ®bi® (ciY | l — 1,..., r} is an algorithm computing tensor (m, n,p); 

(c) the following (rrinp) 2 equations, for all 1 < i, i\ < rn, 1 < j, j\ < n, and 1 < k, k\ < p, 
are satisfied: 

r 

^ ^ kd- i] /.'11 ^iii^jji^kki ■ (2) 

Z=1 


Proof. First we prove that conditions (a) and (c) are equivalent. Consider relation (1) of the 
Introduction, 

r n 


^uvl^uv)^ ^ ^ bvwlVvw') E %ij Uj k) 


1 =1 


1 <u<m 
l<v<n 


1 <v<n 
l<w<p 


3 = 1 


for a given pair (i, k). Clearly, this relation is true if and only if the coefficients at x e fy g h 
on both sides coincide, for every quadruple (e, /, g, h ) such that 1 < e < m, 1 < /, g < n, 
1 < h < p. It is easy to see that the coefficient on the left is Y^i =i c iki a efY>ghh and the 
coefficient on the right is 5i e 5f g 5kh■ Thus we obtain the condition 


^2 C ikl a eflb gh l SieS fg 5 
1=1 


kh 5 


for all i , k , e, /, g, h such that 1 < i, e < m, 1 < /, g < n, 1 < h, k < p. But the latter 
condition coincides with the equality in condition (c), up to names of indices (namely, e, /, 
g , h, i, k should be changed to i, j, j\, k , i\ , ki, respectively). 

In a similar way one can prove that conditions (b) and (c) are equivalent too. Indeed, 
condition (b) means that 

r 

^2ai®bi® (cif = (m,n,p). 

i=i 



V.P.Burichenko, Symmetries of matrix multiplication algorithms. I 


Now it is sufficient to observe that the tensors of the form eij®ej 1 k<&ek 1 i 1 , where 1 < i, i\ < m, 
1 < j,j 1 < n, and 1 < k, k\ < p, form the basis of M mn 0 M np 0 M pm , and to calculate the 
coefficients at basis elements in both sides of the latter relation. 

Finally, the conditions (a) and (b) are equivalent, because each of them is equivalent 
to (c). □ 

Let A = {(ai,bi,ci) | l = 1 ,, r} be an algorithm for multiplication of an m x n matrix 
by an n x p matrix, and let A! = {cq 0 bi 0 cj | l — 1 ^.., r} be the corresponding algorithm 
computing (■ m,n,p ). Then we say that A' is the tensor form of A. 

Example. In the Introduction we have recalled Strassen algorithm and have written it 
in matrix form. It is readily seen that in the tensor form this algorithm is 

<S = { e\\ 0 (ei2 + 622) 0 (e2i — 622), (eii — ^12) 0 622 0 (-en — e2i), 

( — 621 + ^22) ® en 0 (—ei2 — 622), 622 0 (en + e2i) 0 (-eu + e^), 

(en + 622 ) 0 (en + 622 ) <£> (en + 622 ), (en + e 2 i) 0 (en — ei 2 ) < 8 > (—£ 22 ), 

(ei2 + e 22) ® (e2i — £ 22 ) en }. 

Remark. The reader can check directly that the sum of the tensors of the latter set is 
(2,2,2). Such a checking can be considered as an evidence that we had made no mistakes 
when finding the matrix form of the Strassen algorithm from its computational form, and 
then the tensor form from the matrix form. 

The equations (2) first appeared in [10], so they are called Brent equations (but it is pos¬ 
sible that similar equations appeared earlier in studying decompositions of general tensors, 
see [30]). 

One of approaches to finding algorithms for matrix multiplication is to solve Brent equa¬ 
tions, usually by computer calculations. To do this, one usually reduces solving the Brent 
equations to finding minima of certain real-valued function of many (several hundreds) vari¬ 
ables, and then solves this optimization problem by numerical methods. See [10], [27] for 
more details. Other works in this direction are [37] and [40]. In works [14] and [33] the 
Brent equations are also used, but in a different way (without numerical optimization). 


3 Group actions on tensors and algorithms 

Let Ui ,..., U m and Vi ,..., V n be spaces over a held K, and let U — U\ ® ... <8> U m and 
V — Vi 0 ... <E> V n he their tensor products. By a decomposable isomorphism we mean an 
isomorphism of vector spaces tp : U V such that there are a bijection r : {1,..., m) —)■ 
{1 ,... ,n} (whence m = n) and isomorphisms (pi : Ui —» (whence dim V T ^ = dim U % for 

all i — 1 ,... ,m) such that 


<p(ui 0 ■ ■ ■ 0 u m ) = yy-l(i)(w r -l(l)) 0 . . .0 <£r-l(m)(«r-l ( m)) 
for all Ui G Ui, i — 1,..., m. 

For example, the usual permutation of factors n : X 0 Y —> Y 0 X, x 0 y (->• y 0 x, is 
a decomposable isomorphism. 
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It is clear that the composition of two decomposable isomorphisms is a decomposable 
isomorphism also, and the isomorphism inverse to a decomposable isomorphism is decom¬ 
posable too. It follows that a decomposable isomorphism p : U —> V maps the set of all 
decomposable tensors in U bijectively onto the set of all decomposable tensors in V. 

If 93 is a decomposable isomorphism defined by data (r; p \,..., p m ), as in the first para¬ 
graph of this subsection, then we say that r is a permutation, corresponding to p (and p is 
an isomorphism, corresponding to r). For pi the similar terminology is used. 

It is often convenient to think of the permutation r, corresponding to p, as a (bijective) 
map from set of factors {U\ ..., U m } of U to the similar set {V\V m } of V. In particular, 
if U = V, then we may think of r as a permutation of the set {U\ ..., U m }. 

It is clear that if p and if are decomposable isomorphisms, and cr and r are permutations 
corresponding to p and if, respectively, then p = err is a permutation corresponding to the 
isomorphism 9 = pip. Moreover, a -1 is a permutation corresponding to p~ x . (Note that we 
multiply permutations “from right to left”, for example, (134)(2) • (14)(23) = (1)(243)). 

In general, the permutation (as well as the isomorphisms pf), corresponding to a given 
decomposable isomorphism p, is defined by p not uniquely. Consider two examples. 

Example 1. Let m — 2, U\ — (ei), U 2 = (e-p), V\ = (/ 1 ), and V 2 = (f' 2 ) be one- 
dimensional spaces. Their tensor products U — U\ ® U 2 — (ei ® ef) and V — V± ® V 2 — 
(fi < 8 ) fP) are one-dimensional also. Let p : U —» V be the isomorphism taking e\ ® e 2 
to Then p is the decomposable isomorphism corresponding to the permutation 

r = e = {1 H> 1, 2^2} and isomorphisms p\ : ei i-> / x , p 2 : e 2 H > p 2 - On the other 
hand, p may be considered as the decomposable isomorphism, corresponding to permutation 
t / = (1,2) = {1i— >2, 2 1 —>■ 1} and isomorphisms p\ : ei (->• / 2 , p ' 2 : 62 1 —^ fi ■ 

Example 2. Let p\ : U\ —> V\ and P 2 : U 2 —> V 2 be isomorphisms of spaces. Then 
the decomposable isomorphism 

p = p\ ® P 2 '■ U = Ui®U 2 —* V — Vi 0 V 2 

can also be written as p = p\ ® p' 2 , where p[ = Xpi and p 2 = \~ l p 2 , for any A G K*. 

It turns out that the possible ambiguity of data (r; pi ,..., p m ), corresponding to a given 
decomposable isomorphism p , may be only of these two kinds. To prove this, we need two 
simple (and well-known) statements. 

Lemma 3.1 1) Suppose that p e GL(V) is an automorphism of a space V such that 
p(( v )) = ( v) for each one-dimensional subspace (v) C V. Then p is a multiplication by 
a scalar A G K*. 

2) Let p,p' : U —» V be isomorphisms such that p({u)) = p'((u)) for each one¬ 
dimensional subspace (u) C V. Then there exists A G K* such that p' = Xp. 

Proof. 1) Let v \,..., v n be a basis of V. As p preserves all (vf), we have p(vf) = A iVi, for 
some elements A* G K*. Next, take any i 7 ^ j. The vector pipy + vf) = \vi + A jVj must 
be proportional to Vi + Vj, whence A* = A j. Consequently, all Ai = A 2 = ... = A n coincide, 
whence p is the multiplication by a scalar A = Ai. 

2) It is obvious that 9 = p~ l p’ is an automorphism of U, and for any u G U we have 
9({u)) = (p~ l p')({u)) = p^ 1 (p , ((u))) = p~ 1 (p((u))) = (u). Now we have 9 — A (=A • id[/) 
by 1), whence p' — Xp. □ 
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Lemma 3.2 Let U — U\ ® ® U m be the tensor product of several spaces, and for each 

i = 1,... pm let Ti,T' C Ui be two nonzero subspaces. Then the subspaces 

f = T 1 ®...®T m and T' = T[ ® ... ® T' m C U 

coincide if and only if T, = T[ for alii. In particular, if 0 ^ t = u \®... <g> u m — u[ ®® u' m 
are two decompositions of a nonzero decomposable tensor, then (uf) = ( v! i ) for all i = 
1 

(The proof is left to the reader.) 

Proposition 3.3 Let U\ ..., U m and Vi,... ,V m be two sets of m spaces each, and let U = 
U\ <g)... <S> U m and V = V\ ®... ® V m be their tensor products. Let r, t' G S m be permutations 
such that dim Wp) = dim K-'p) = dim Ui, let <pi : U —* Kp) and ip\ : Ui —* W'p) be 
isomorphisms, and let ip, ip' : U —* V be the decomposable isomorphisms correspondmg to 
the data (r; p >\,..., < p m ) and (r'; ,..., p >’ m ), respectively. Suppose that ip = ip'. Then the 
following two statements hold. 

1 ) t' — ra, where a is a permutation of {1,, m} such that a(i) = i for all i such that 
dim Ui > 1. In particular, r' = t, if at most one of the spaces Ui is one-dimensional. 

2) Suppose that r = t'. Then there exist elements Ai,..., X m G K* such that ip[ = \pi. 
For these A i we have X\... \ m = 1. 

Proof. 1) Define cr = r _1 r / . Then t' = rcr. It follows from the previous discussion that a is a 
permutation corresponding to the decomposable automorphism 9 = ip~ l p>' of U. But, clearly, 
9=1 = idjy So it is sufficient to prove the following: if the decomposable automorphism 9 
of U, defined by the data (cr; ifi ,..., fi m ) (where f>i : Ui —» U (J (p are some isomorphisms) 
coincides with idp, then a(i) = i for all i such that dim U t > 1. 

We may assume without loss of generality that i = 1. For each j = 2,... pm take a 
one-dimensional subspace lj C Uj and consider the subspace 

W = U 1 ®l 2 ®...®l m CU. 

Then 9(W) = X 1 ® ... <g> X m , where = ipi(Ui) = U a ( p, and X r = '0a- 1 (r)(^- 1 (r)) for 
r cr(l). In particular, dim X a ^j > 1 and dim Xj = 1 if j ^ cr(l). 

Since 9{W) = W, it follows from Lemma 3.2 that X\ = U\ and Xj = lj for j > 2. In 
particular, dim Ad > 1 and dim Xj = 1 for j > 2. Consequently, cr(l) = 1. 

2) Consider 9 = = idp again. It is rather clear that 9 is the decomposable 

automorphism of U, corresponding to (e;^i,..., ^ m ), where e is the identity permutation 
of {1, • • • pm} and ipi = (pf 1 ^. That is, 9 = ipi ® ... ® f> m . So it suffices to show that if 
ipi G GL(Ui ) are some automorphisms such that 9 = ifi ® ... ® ip m = idp, then there exist 
Ai,..., A m e K* such that ifi = Apd^, and that these A* satisfy the relation Ai... \ m = 1. 

Take arbitrary nonzero elements tq G Ui, Ui 0. Then w = U\ ® ... ® u m 0, and 
w = 9(w) = . .®^ m («m). It follows from Lemma 3.2 that (■ ipi(ui )) = (uf) for all u*. 

Since u t were taken arbitrarily, it follows that ipi(x) is proportional to x for all x G Ui. By 
Lemma 3.1, there exists A* G K* such that ipi = Apd[/ r Finally, w = ^i(wi)<£>. • ■®'f>m( u m) — 
Ai«i ® ... <E) A m Um — Ai... A m w, whence Ai... A m = 1. □ 
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In particular, we see that if at most one of the spaces Ui is one-dimensional, then for 
any decomposable automorphism ip of the space U — U\ ® ® U m the corresponding 

permutation of {U\,, U m } is determined uniquely. 

By S(Ui,..,,U m ) we denote the group of all decomposable automorphisms of U = 
Ui <g)... <g> U m . Next, by S°(Ui ,..., U m ) we denote the subgroup of S(U \,..., U m ) consisting 
of all automorphisms that preserve each factor Ui (that is, corresponding to the trivial permu¬ 
tation of {U \,..., U m }). In other words, S°(U \,..., U m ) is the image of the homomorphism 
GL(Ux) x ... x GL{U m ) -)■ GL(U) defined by 

Clearly, S Q (U\ .., U m ) is normal in S{U\ U m ). The corresponding quotient group 
T = S(U \,..., U m ) / S°(Ui ,..., U m ) may be described as follows. Let hi = {Ui | dim U > 1} 
be the set of all factors U t of dimension > 1, and let 

T' — {(] e Syrn(O) | dimg(AT) = dimX V X e hi} 

be the group of all permutations of these factors, preserving dimensions. Then it is easy 
to deduce from Proposition 3.3 that T can be identified with T' (the details are left to the 
reader). 

Let t G U be an arbitrary tensor. We call the set of all decomposable automorphisms of 
U that preserve t the (full) isotropy group of t, and denote it by T(f): 

r{t) = {geS{U 1 ,...,U m )\g{t)=t}. 

We also consider the small isotropy group 

r°(t) = r(t) n5°(f/ 1 ,...,c m ). 

Clearly, T°(f) < T(f) and T(f)/r°(f) may be identified with a subgroup of the above- 
mentioned T (that is, with a certain group of permutations of factors of dimension > 1, 
preserving dimensions). 

Finally, let A — {fi,..., t r } be an algorithm computing t. Then 

Aut(*4) = {geS(U u ..., U m ) | g(A) = A} 

will be called the automorphism group of A. Obviously, Aut(*A) preserves t\ + ... + t r = t, 
whence 

Aut(*4) < T(f). 

If u G U, v G V, and tp : U —> V is a decomposable isomorphism such that p>{u) = v, 
then rk(u) = rk(u). Moreover, if A is an algorithm of length /, computing u, then B = <p(A) 
is an algorithm of length /, computing v (and conversely, if B is an algorithm of length l for 
v , then A = (p^ 1 (B) is an algorithm of length / for A). Therefore, ip bijectively maps the set 
of all optimal algorithms computing u to the set of all optimal algorithms computing v. 

In particular, we see that T(f) acts on the set of all optimal algorithms computing t. 
Obviously, the stabilizer of a point (i.e., of an algorithm) with respect to this action is the 
automorphism group of a given algorithm. 
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Example. Let U\ — U 2 — U 3 — M 2 2 (K) and t = (2,2,2) e U\®U 2 ® U 3 . In this 
case it was shown by de Groote [21] that T(t), and even T°(t), acts on the set of all optimal 
algorithms transitively, so this set is an orbit. The stabilizer (in the full T(t)) of a point in 
this orbit is nothing else but Aut(«S), the automorphism group of the Strassen algorithm, 
which is isomorphic to S :i x S 3 by [13]. 

Remark. It is natural to consider more general situation, when studying decompositions 
of tensors, as it is done in [20]. Let 7 Z(U) be the set of all decomposable tensors in U = 
£/i<8>.. .®U m . A linear map <p : U —>■ V is called a Segre homomorphism, if (p(lZ(U)) QTZ(V). 
Next, let t E U, let (p : U —* U be a Segre endomorphism such that (p(t) = t, and let A be an 
optimal algorithm computing t. Then ip(A) is also an optimal algorithm computing t. So the 
semigroup of all Segre endomorphisms, preserving t , acts on the set of all optimal algorithms 
computing t. Thus one may think that studying of general Segre endomorphisms may be 
useful in algorithm analysis. However, it was, in fact, shown in [20] that such studying 
completely reduces to consideration of decomposable automorphisms. 

4 The isotropy group of (m, n,p) 

Let A = {ti, ..., t r } be an algorithm computing the tensor t = ( m,n,p ). It was observed 
above that its automorphism group Aut(A) is contained in the (full) isotropy group T(t): 

Aut(A) < T (t). 

So, before studying group Aut(A) for any particular algorithm A, it is natural to find the 
group T (t). This is the aim of the present section. 

In the case m — n — p the group T(t) was already found, independently by Brockett- 
Dobkin, Strassen, and de Groote. Actually, Strassen and de Groote found T(t) in the case 
where t is the structure tensor of a finite-dimensional simple A'-algebra, that is, a matrix 
algebra over a skew held. See the comments after Theorem 3.3 of [20]. 

4.1 Some known facts 

We begin with several standard statements. 

Let V = K l be the space of columns of height l (/ e N) with elements in K , let V' be 

the space of rows of the same length /, and (ei,_, e{) and (e 1 ,..., e l ) be the usual bases 

of V and V' (i.e., e, is the column whose i-th element is 1, the others are equal to 0). Note 
that the rule (v,v’) 1 —» v'v, where v E V and v' E V' , dehnes nondegenerate bilinear map 
V x V' —> K (pairing), and the bases (e^) and (e*) are dual with respect to this pairing 
(note that v'v is an 1 x 1 matrix, i.e., an element of K ). Thus, we may identify V' with V*, 
the dual space of V. 

Denote G = GLi(K). Then G acts on V on the left as usually: (g,v) i-> gv, where gv is 
the usual product of a matrix and a column. Also, there is a left action of G on V' by the 
rule 

(g,v') H- gov' := v'g~ l . 

(This is a left action indeed, that is, (gh) o v' — g o {h o v') for all g,h E G and v' E V'. 
Indeed, g o {ho v') = g o (y'h~ l ) = (u'h -1 )^ -1 = v'h^ 1 g~ 1 = v\gh)~ l = (gh) o v'.) So there 
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is a left action of G on V ® V' — V ® V* such that 

g(y ® v') = gv ® v'g~ x Wg G G, v G V, v' G V'. 

Consider the tensor 

i 

6 = 22 ei ® = 22 Sij e i ®e j eV ®V* 

i=l l<ij<n 

(so-called identity tensor, i.e., the tensor, associated with the identity linear map of V. 
Observe that the coefficients of the tensor <5 are precisely 5^, where 5^ is the Kronecker 
symbol. This justifies using the same symbol 6 both for identity tensor and the Kronecker 
symbol). 

The following lemma is well-known (even trivial; cf. Remark 3 in Subsection 1.3). 
Lemma 4.1 We have g5 = 5, for all g G G. 

Proof. Let a l} and b l3 be the coefficients of the matrices g and <? _1 , i.e., 

g ^ ^ dij^ij and (j ^ ^ bijCij. 

1 <i,j<l 1 <i,j<l 

Then get = , <t j-,( , and e l g~ l = Y2j=i b^e?. Hence 

i i i i i 

9$ = gC22 ei ® e l ) = 22 9e i® el 9^ 1 = 22^22 a ^ ® C2^b ik e k ) 

i =1 i= 1 i =1 j =1 k =1 

l l 

= 22 E a P b ik)ej ®e k = 22 ( S ik) e j ®e k = 22 e i ® e J = 5, 

\<j,k<l i= 1 1 <j,k<l 3 =1 

as EL = S* for all 1 < j,k < l (because matrices g and g 1 are inverse). □ 


Tensor products of group representations. Recall the notion of tensor product of 
representations of a group (see [31], §VIII.7, or [16], § 12). Let F be a field, G be a group, 
and let U and V be FG-modules, that is, F-spaces endowed with F-linear action of G. Let 

T : G —» GL(U) and R : G — > GL(V) 

be the corresponding representations of G on U and V. Put W = U ®f V■ For an element 
g G G define linear map S{g) : W —» W by S{g) = T(g) ® R(g). Since both T{g) and R{g) 
are invertible, S(g) is invertible also. Moreover, 

S(9i92) = T(g l g 2 ) <g> Rigm) = T(g 1 )T(g 2 ) ® R(gi)R(g 2 ) 

= (T(gi) <g> R( gi ))(T(g 2 ) ® R(g 2 )) = S{ 9l )S(g 2 ). 


That is, 

S : 9 ^ S(g ), G —► GL(W) 

is a representation of G. It is called the tensor product of representations T and R (and W 
a tensor product of FG-modules U and V). 

If Ui, U 2 , Vi, and V 2 are FG'-modules, and a : U\ —* U 2 and (3 : V\ —* V 2 are FG- 
homomorphisms, then (as is easy to check) the map 7 = a ® (3 : U\ ® V\ —* U 2 ® V 2 is a 
FG'-homomorphism also. 



V.P.Burichenko, Symmetries of matrix multiplication algorithms. I 


14 


4.2 A subgroup of T°{t) 

Let D = K m , E = K n , and F = K p be the spaces of columns over K of height m, n, and p, 
respectively, and D', E', and F' be the spaces of rows of the same length. By di, 6j, A, d\ 
ef and f k we denote the elements of the usual bases of D ,..., F'. As observed earlier, we 
may identify D', E' and F' with D*, E*, and F*, respectively. 

Note that for any d G D and e' G E' their product del is an m x n matrix. In particular, 
d je - 7 = eij are matrix units, for all 1 < i < m, 1 < j < n. Similarly for ef and fd'. 

Denote M mn = M mn (K), M np and M pm by Li, E 2 , and L 3 , respectively, and let 


L = L x ® L -2 ® L 3 = M mn <g) M np <g) M pm . 

Also denote 


N = D®D'®E®E'®F®F'. 
Next, define the linear map t : N L by the rule 


t : d <g) d' ® e ® e' ® ® /'(->• de' <S> ef ® fd 1 . 


This map is well-defined indeed, because de', ef, and fd' are in L x , L 2 , and L 3 , respectively, 
and, moreover, the expression de' <g) ef ® fd' is linear in each of the arguments d, d ',..., f. 
It is easy to see that r is an isomorphism of vector spaces. 

Let dp = ]T -=i d t <g) dl, dp and dp be the identity tensors of the spaces D, E, and F. 
Consider the tensors dp < 8 > dp <£> dp G N and r(dp ® dp ® dp) G L. 

Lemma 4.2 The equality r(dp <£> dp <S> dp) = ( m,n,p) holds. 

Proof. We have 


r(dp < 8 ) dp <g) dp) 


= T 


( E di ® ® ( E 


e,- <g) e - 1 




i= 1 


1=1 


fc=l 


r( E ^ ® d l ® ej ® e - 7 ® f k ® / fc ) = E die j ® e jf k ® fkdf 


1 <i<m 
l<j<n 
l</c<p 


l<i<m 
1 <j<n 
l</c<p 


E e *i ® ® e fci = (m, n, p). 

l<i<m, !<;<n, l<fc<p 


□ 


Next we consider some group actions. Put G^ = GL m (K), Gp = GL n (K), Gp = 
GL p (K). Then G# acts on D and D', G^ — on E and E', and G^ acts on F and F'. 

Form the direct product G = Gp x Gp x Gf and define actions of G on D,..., F'. For 
example, if g — (g 3 , g 2 , g 3 ) G Gp x Gp x Gf, d G D and dl G D ', then we define g(d) = g 3 d 
and g(d') = d'gf 1 (here g(d) and g(d') mean the result of group action, and g\d and d'gf 1 
mean the products of matrices). That is, the factor Gp of G acts on D and D' as usually, 
whereas Gp and Gp act trivially. The actions on E, E', F, F' are defined similarly. 

Now we can consider N = D ® ... ® F' as a G-module. 
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Further, it is easy to see that the rules 

( 91 , 92 ,9s)x = gixgf 1 , fhxgf 1 , or g 3 xgf l 

define actions of G on L \, L 2 , and L 3 , respectively. Hence we can define action of G on 
L = L\ ® L'2 ® L 3 . 

In the proof of the following lemma (and later) we use the following simple 
Observation. Let K be a field, G a group, X and Y be KG- modules, and let y : X —y Y 
be a K -linear map. b y is a A'G'-modulc homomorphism and a id-spaces isomorphism, then 
y is a KG -module isomorphism (that is, the inverse map y ” 1 : Y —y X is a homomorphism 
of idG'-modules). 

Lemma 4.3 The map r : N —y L, defined above, is a G-module isomorphism. 

Proof. As r is an isomorphism of vector spaces, it remains to check that r is a (S'-module 
homomorphism, that is, r(g(x)) = g(r(x)) for all g G G and x G N. 

It is sufficient to consider x — d ® d' ® e ® e' ® f ® /'. Let g = (gi, g 2 , . 9 . 3 ) • Then 

T(g(x)) = T((g l ,g 2 ,gfi){d®d'®e®e'®f®f')) 

= rigid ® d'gfi 1 ® g 2 e ® e'gf 1 ® g 3 f ® f'gf 1 ) 

= (9id)(e'gf 1 ) ® (^ 2 e)(/ , 9 3 " 1 ) ® {dsDid’gf 1 ) 

= gide'gf 1 ® g&f'gf 1 ® 9 zfd'gf v = {g\,g 2 , gfi){de' ® ef ® fd!) 

= 9 ( T ( X ))- 

(Note that there is an alternative way to prove that r is a G'-homomorphism. Namely, 
observe that the formula x ® y e->■ xy defines G'-homomorphisms a : D ® E' —y Li, fi : 
E ® F' —y L 2 and 7 \ F ® D' —y L 3 . So their product 

a®fi®y:D®E'®E®F'®F®D' —> Li® L 2 ®L 3 

is a G -homomorphism also. Also, the “permutation map” 

C : D ® D' ® E ® E' ® F ® F' —» D ® E' ® E ® F' ® F ® D' 

is obviously a G- ho rno mo r p his rri. Now it remains to observe that r coincides with [a® fi ® 

7) 0 C- □ 

The proof of the following simple lemma is left to the reader. 

Lemma 4.4 Let a G GL m (K), b G GL n (K), and x G M mn (K). If either ax = 0, or xb = 0, 
or axb = 0, then x — 0. So the map y i-G ayb on M mn (K ) is invertible. 

Proposition 4.5 Let L = Li® L 2 ® L 3 be as above. For a G GL m (K), b G GL n (K), and 
c G GL p (K) let T(a, b,c) : L —y L be the linear map defined by 

T(a, b,c)(x ® y ® z) = axK 1 ® bye -1 <g) cza L . 


Then 


H = { T(a,b,c) | (a,b,c) G GL m (K ) x GL n (K) x GL p (K)} 


is a subgroup ofT°(t). 
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Proof. It follows from Lemma 4.4 that T(a,b,c ) is an automorphism of L. It is also easy to 
see that T(a, b, c)T(a\, b\, cf) = T(aai, bbi, cci) for any a,... ,c\. Therefore H is a subgroup 
of S°(L u L 2 ,L 3 ). 

Let D, E, F, ... be as above. The group G = Gp x Ge x Gp = GL m (K ) x GL n (K) x 
GL p (K) preserves Sp G D <g) D', when acting on D <g> D', according to Lemma 4.1. Similarly 
G preserves Sp and Sp, and so preserves 5p<S)5e®Sf- As t \ N —* L is a G'-homomorphism 
and t = t(5e> <8> bp <E> Sp), we see that G preserves t. 

It remains to note that the image of (a, b, c) G Gp x Ge x Gp in GL(L) coincides with 
T(a, b, c ). So T(a, b, c ) preserves t for any a, 6, and c, whence H < T°(t). □ 

4.3 Structure of the full T(t) 

The full isotropy group T(t), where t = ( m,n,p ), may be larger than T°(t). However, the 
relations between T(t) and T°(t) can be easily described. This is the aim of the present 
subsection. 

In the rest of this section we assume that at least one of the numbers m, n, and p is 
different from 1. If this is the case, then at most one of the three spaces L {l L 2 , and L 3 is 
one-dimensional. So for any decomposable automorphism tp G S'(L 1 , L 2l L 3 ) the permutation 
of {Li, L 2 , L 3 }, corresponding to tp, is uniquely determined, by Proposition 3.3. 

First of all, we construct some elements of T(t), not belonging to T°(t). For a permutation 
g of {L\,L 2 , L 3 } we define certain decomposable automorphism p g : L —* L (however, p g 
will be defined not for any triple ( m,n,p )). 

Suppose that m = n. Define /?( 2 3 ) : L —> L by the formula P(pS){x®y® z) = x t ® z t ®y l . 
(Note that we use the same symbol t for the tensor t = (m, n,p) and the transpose map, but 
we hope this will not lead to a confusion). Note that p( 23 ) is well-defined indeed, because the 
formula x H» x l defines an isomorphism of the space L 2 = M np = M mp onto L 3 = M pm , and 
also this formula defines an isomorphism of L 3 onto L 2l and an automorphism of L\. Observe 
next that p^ 23) = 1 (= id L ), as p 2 {23) (x®y®z) = p (23) (p (23) (x <g> y ® z)) = p (23 ) (^ ® ® y l ) = 
(x*)* < 8 ) (yty ® (z t ) t = x ® y ® z. Finally, we have p( 23 ) G T(t), because 

P(23) ( t) = p(23) ( ^2 e ij ® e jk <8) e ki ) = ^2 e P ® e ik ® e k j = t 

(note that if we change names of indices in the latter sum by i —> j, j —» i, k —> k, then 

we obtain the sum for t). 

Similarly, if m — p or n = p, then we may define p^ 2 ) and p(i 3 ) by formulae 

P(i 2 ) {x®y®z) = y 1 ® x f ® z 1 , and 

P(i 3 ) (x®y®z) = z l ® y l <g) x t , 

respectively. 

Next suppose that m — n — p. Then we define P(i 23 ) and P(i 32 ) by the formulae 

P(i 23 ) (x®y®z) = z®x®y, 

resp. 

P(i 3 2 ) (x®y®z) = y ® z ® x. 
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Clearly, pj\ 23 ) = P(i 32 ) and pf 123) = 1- Also, it is easy to see that P(i 23 ) £ r(t). 

Finally, for any triple m, n, p define p e = id/,. 

Observe that the permutation of the factors L 3 , L 2 , and L 3 , corresponding to p g , is 
precisely g. Hence p g ^ 1 if g ^ 1, and also p g ^ ph , if g ^ h. 

Let Q be the set of all p g , that are defined, for given m, n, and p. Thus, 


( 


Q= < 


{Pe = 1}, 

{1,P(23)}, 

{1,P(12)}, 

{1, P(13)}, 

{ 1 , P(12) , P(13), P( 23), P(123) > P(132) }, 


if m ^ n ^ p 7 ^ m, 
if m — n 7 ^ p, 
if m = p n, 
if n — p ^ m, 
if m — n — p. 


Lemma 4.6 For any m, n, and p the set Q is a subgroup ofY(t). Let R < S 3 be the group 
of all permutations of {L 1? L 2 , L 3 }, preserving dimensions. Then the rule g yy p g defines 
isomorphism R -B- Q. Thus, Q = S 3 , Z 2 , or 1, when \{m, n,p}\ = l, 2, or 3, respectively. 


Proof. First suppose that m, n, and p are pairwise distinct. Then the numbers dim L\ = mn , 
dim L 2 = np, and dim L 3 = pm are pairwise distinct also, whence R — 1. Thus, in this case 
both Q and R are trivial groups, and the statement is trivial too. 

Further suppose that \{m,n,p}\ = 2. We consider, as an example, the case m — n ^ p 
only. In this case Q = {p e = 1,P(2 3 )}- Moreover, dimLi ^ dimL 2 = dimL 3 , whence 
R = {e, (23)}. As p^ = 1 and p( 2 3 ) ^ 1, we see that Q is a group isomorphic to Z 2 , and 
that the Injection e -H- p e = 1, (23) -H- p( 2 3 ) is an isomorphism between R and Q. 

Finally consider the case m — n — p. In this case mn = np = pm, whence R = S 3 
consists of all permutations of {L \, L 2 , L 3 }. Moreover, all p g are pairwise distinct, and any 
p g is an automorphism of L. So it is sufficient to prove that p g pu = p g h for each pair of 
9 , h G S 3 . 

It is not hard to check the latter equality in all cases directly. If g = e or h = e, then 
this equality is trivial, as p e = 1. It remains to check this equality for 25 pairs (g, h) with 
g,h e, which is not too many. 

There is, however, a shorter argument. For g G S 3 let n g be the usual permutation of 
factors, for example 7T(i 3 )(o; <g) y <g) z) = z ® y ® x. Then n g 7 Th = n g h for any g and h. Next, 
let r : L — y L be the componentwise transpose map, i.e., r{x ®y®z) = x k ® y t ® z f . Note 
that r commutes with all x g , and also that r 2 = 1. Next, note that p g = Ti g if g is even, and 
p g = Tn g if g is odd. In other words, p g = 7 T g T £ ^ 9 f where £ : S 3 —> Z 2 = (0,1} is the parity 
homomorphism. Now for any g and h we have p g ph = XgT^XhJ 5 ^ = n g nhT £ ^T £ ^ = 
7T gh T £ ^ +£ ^ = 7T g hT £ ^ = p g h, as required. □ 


To state the next proposition it is convenient to use the notion of semi direct product. 
Recall that a group G is the product of its subgroups A and B , which is denoted by 
G = AB, if for each g G G there exist a G A and b G B such that g = ab. If in addition 
A fl B — 1, then it is easy to see that the representation of g in the form g = ab is unique. 

A group G is said to be a semidirect product of A by B, which is denoted by G = A A B, 
if G = AB, A is normal in G, and A D B — 1. 

Proposition 4.7 Lett = (m,n,p), and let Q < F(t) be the subgroup described above. Then 

r(t) = r°(t) x q. 
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Proof. We know that r°(t) < T(f). Further, Qnr°(i) = 1, because a nontrivial element of Q 
corresponds to a nontrivial permutation of L i, L 2 , A 3 . It remains to show that T(t) = r°(t)Q. 
Let x G T(t), and let g be the permutation of L\, L 2 , A 3 , corresponding to x. Then g preserves 
the dimensions of factors, and therefore p g is well-defined and p g G Q. Since the permutation 
of factors, corresponding to p g , is g, it follows that the permutation of factors, corresponding 
to the element x' = xpf 1 , is trivial, that is, x' G r°(f). Thus, we have x = x'p g , where 
x' G T°(t) and p g G Q. Hence T(£) = T°{t)Q. □ 

Proposition 4.8 Let T(a,b,c ) and H be the transformations and the group introduced in 
Proposition f.5. Then any element g G T°(t) has the form g = T(a, b, c ), for some a, b, and 
c. Therefore, T°(t) = H. 

This proposition will be proved later in this section. 

Next we describe the group T°(t) as an abstract group. 

Recall that the projective general linear group PGL n (K) is the quotient group 

PGL n {K) = GL n (K)/Z n (K), 

where Z n (K) = {A E n | A G A'*} is the subgroup of all nonzero scalar matrices. 

(For reader’s information we recall the following standard facts on linear groups; they 
can be found in many textbooks, see for instance [17] (§§I.l, 1.2, II. 1, II.2), [28] (§13), and 
[43] (§1.9). 

The group Z n (K ) is the center of GL n (K). The group PGL n (K ) contains the projective 
special linear group PSL n (K) = SL n (K)/(Z n (K ) D SL n (K )) as a normal subgroup. The 
latter group is simple, except for the two cases ( n,K) = (2,F 2 ), (2, F 3 ). The quotient 
PGL n fK)/PSL n (K) is trivial, if K is algebraically closed, and is a finite cyclic group if K 
is finite.) 

Let (p : GL m (K ) x GL n (K ) x GL p (K ) —» r°(t) be the map defined by tp((a,b,c)) = 
T(a, b, c ). It was observed in the proof of Proposition 4.5 that ip is a group homomorphism. 
Proposition 4.8 shows that p is surjective. Therefore, in order to describe its image Irn p = 
H = r°(t) (as an abstract group, i.e., up to isomorphism), it is sufficient to know Ker<^, its 
kernel. 

We need a lemma. 

Lemma 4.9 Suppose A G GL m (K) and B G GL n (K ) be matrices such that AxB is propor¬ 
tional to x for all matrices x G M mn (K). Then A and B are scalar matrices. 

Proof. Consider the map a : x 1 —> AxB of M mn (K) to itself. It follows from Lemma 4.4 that 
a is an isomorphism. So a is a scalar map by Lemma 3.1 : AxB = cx, for a fixed c G K*. 
Next, let aij and bij be the coefficients of A and B , respectively: 

m n 

A = ^ ^ e ij 1 ^ij e ij ■ 

i,j=l i,j= 1 

Then for all p and q such that 1 < p < m and 1 < q < n we have 

C€pq ACpqB ^ ^ aipbqjCij , 

l< 2 <m, l<j<n 



V.P.Burichenko, Symmetries of matrix multiplication algorithms. I 


19 


whence a pp b qq = c, and di P b q j — 0 if i ^ p or j q. The former of these relations implies that 
a pp , b qq ^ 0 for all p and q. Taking i = p, j q in the second relation, we obtain a pp b q j = 0, 
whence b q j = 0. Similarly, we obtain a* p = 0 for i p. Thus, A and B are diagonal matrices. 
Next, for any q we have anb qq = c, whence b qq = c/an, so B is a scalar matrix. Similarly, A 
is a scalar matrix also. □ 


Proposition 4.10 The kernel Ker <p coincides with Z m {K) x Z n {K ) x Z p (K), and therefore 
the group T°(t) = H is isomorphic to PGL m (K ) x PGL n (K ) x PGL p (K). 

Proof. Let (a, b , c) = (. XE m , pE n , vE p ), where A, /i, v G K*, be an element of IV = Z m (K ) x 
Z n (K) x Z p (K). Then for any x G Li, y G L 2 and z G L 3 we have 

T(a, b,c)(x <g> y ® z) = XxpT 1 ® pyv~ l <£> vz A -1 = x ® y ® z, 

so T(a,b,c) = 1. Hence N < Ker <p. 

Conversely, suppose that (a, 6, c) G Ker <p. Then T(a, b,c)(x ® y ® z) = x ® y ® z for all 
x, y, z, that is, axb _1 <g) fepc -1 <g) cza” 1 = x ® y ® z. Hence axb _1 is proportional to x by 
Lemma 3.2. So both a and b are scalar matrices by Lemma 4.9. The matrix c is scalar also 
by a similar argument, whence (a, b, c) G N. Therefore Ker <p < N. □ 


It may be useful to have explicit formulae for conjugation of an element of Ef by an 
element of Q (however, we will not use these formulae in the present work). 

For a matrix x G GLfK) we denote by x v the matrix x v = (x*)” 1 = (x” 1 )* (which is 
usually called the matrix contragradient to x). 

Proposition 4.11 The following relations hold: 

p e T(a,b,c)pf 1 = T(a,b,c), 

P(i 2 )T(a, b, c)p" 2) (= p ( i 2 )T(a, b, c)p ( 12) ) = T(c w , 6 V , a v ), 

P(i 3) T(a, b, c)p { is) = T(a v , c v , 6 V ), 

P(23)T(a, b, c)p (23 ) = T(6 V , a v , c v ), 

P{yzyfff , b, t')P( i23) bf 

P(i 32 )T(a, 6, c)p(j 3 2) = T(6, c, a). 

Proof. The hrst relation is trivial, because p e = 1. Prove the next relation, as an example. 
Note that p(j 2 ) = P(i 2 ), as p^ 12) = 1. 

For x G Li, y G L 2 , and z E L 3 we have p(i 2 )(x ® y ® z) — y t ® x f ® z f , whence 


x ® y ® z 


P \ h -> ) ®x t ® z t T eG ^ 1 0 1 

(fa^c- 1 )* ® (a^r 1 )* ® (c^a -1 )* = 
= c v x(6 v ) -1 ® 6 v p(a v ) _1 ® a v 2 ;(c v )' 1 


® c/a 1 

(c-'fxtf <g> ib^fya! <g> (a _1 )^c* 
= T(c v , 6 V , a v )(x ®y®z ), 


whence 


p ( i 2 )T(a, 6 , c)p ( i 2 ) = T(c v , 6 V , a v ). 
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The other relations can be proved similarly. □ 

We summarize the results of this section in the following theorem. For the convenience 
of the future usage, we state this theorem “in full”. 

Theorem 4.12 Let m,n,p G N, ( m,n,p ) (1,1,1), let L\ = M mn = M mn (K), L - 2 = M np , 

L 3 = M pm , let L = Li <g) L -2 ® L 3 , and let 


t = {m, n,p) 



e-ij t r /j eki 


l<i<m, l<j><n, 1 <k<p 


e 


L. 


For elements a G GL m (K), b G GL n (K), c G GL p (K ) define transformation T(a,b,c ) : 
L —» L by the formula 


T(a,b,c)(x <g) y <E> z) = axb 1 ®byc 1 <g) cza 


Put 

H = { T(a,b,c ) | ( a,b,c ) G GL m (K) x GL n (K) x GL p (/i)}. 

T/?,en r°(t) = H. The transformations T(a,b,c ) and T(ai,6i,ci) coincide if and only if 
Oi = Aa, 6 i = pb, and c± = uc, for some X,p,u G IF*. The group H is isomorphic to 
PGL m (K ) x PGL n (K) x PGL p (K). 

For any element g G S 3 and some m, n, p define transformation p g : L —* L as follows: 
p e = 1 = idi, /or any m, n, p; 

P( 23 ) {x ®y ® z) = x 1 ® z l ® y f , when m = n; 

P(i 3 ) (x ® z) = z* ®y f ® x l , when n = p; 

P( 12 ) (x ® y ® z) — y 1 ® x t (g) 2 /, w/ien m = p; 

P( 123 ) (x <g) y ® z) = z ® x ® y and p(i 32 ) (x ®y ® z) = y ® z ® x, when m — n — p. 

Put 

Q = {Pg | <7 G 5*3 , p s is well-defined}. 

In other words, 


Q 


{Pe = 1}, 

{1,P(23)}, 

< {1, P(12)}, 

{1, P(13)}, 

k {1; P(12), P(13), P(23) i P(123) 1 P(132) }, 


if m n p m, 
if rn — n p, 
if rri — p n, 
if n = p m, 
if m — n — p. 


Then Q is a subgroup ofT(t), isomorphic to S 3 , Z 2 , or 1, when \{m, n,p}\ = 1, 2, or 3, 
respectively. 

The group T(t) is a semidirect product ofT°(t) and Q: 


The following relations, describing action of Q on H by conjugation, hold (in all cases where 
p g is well-defined): 


p e T(a,b,c)p e 1 = T(a,b,c), 
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p ( i 2 )T(a, b, c)pf 1 2 } (= p(i 2 )T(a, 6, c)p ( i 2)y ) = T(c v , 6 V , a v ), 

P(i 3) T(a, 6, c)p(i 3) = T(a v , c v , 6 V ), 
p ( 23)T(a, b, c)p(23) = T i bV , a V , c v ), 

P(123) b , c)p^23) 

P(i 32 )T(a, b, c)p~ 1 3 2) = T(6, c, a). 

This theorem is an immediate consequence of Propositions 4.5, 4.8, 4.7, 4.10, and 4.11. 

The aim, of the rest of this section is to prove Proposition f.8. 

4.4 Structure tensors and contragradient maps 

In this subsection we recall some well-known notions. 

Let V be a space and V* be its dual. For two elements v G V and l G V* it will be 
convenient to denote l{v) either by (l,v) or by (v,l). Thus, the element (ui,u 2 ) is defined, if 
one of elements u\ and u 2 is in V, the other is in V *; and we always have (ui,u 2 ) = {u 2 , uf). 
The symbol (iq, u 2 ) is called the pairing of U\ and u 2 . 

Let / : A — y Y be a linear map. The map f* : Y* — y X*, taking an element l G Y* 
to the element m = f*(l) G A"*, defined by m(x) = l(f(x)), is linear and is called the map, 
dual to /. Thus, f* is the unique map such that 

(l,f(x)) = (r(l),x), VxgA, leY*. 

For any two maps / : X — y Y and g : Y — y Z the equality (gf)* = f*g* holds. 

Suppose / : X — y Y is an isomorphism. It is easy to see that f* : Y* — y X* is an 
isomorphism also. The inverse isomorphism (/*) -1 : A"* — y Y* is called an isomorphism, 
contragradient to /, and is denoted by / or ff It will be convenient for us to denote it 
by / v . This isomorphism can be described as the unique isomorphism / v : A"* — y Y* such 
that 

(r(l)J(x)) = (l,x) V xex, lex*. 

It is easy to see that (p/) v = fi ,v / v for any two isomorphisms / : X — y Y and g : Y — y 
Z. Also, (/ -1 ) v = (/ v )^ 1 . In particular, suppose that : G — y GL{X) is a representation 
of a group G on a space X. Then the map <p* : G — y GL(X*), defined by < p*(g) = <p(g) v , is 
a representation of G also. It is called a representation contragradient (or more often dual ) 
to (p. (Thus, the usage of the word “dual” for linear maps and for group representations is 
somewhat different). 

Finally note that taking contragradient map is involutive, that is, (/ v ) v = / for any 
isomorphism / : X — y Y. (Strictly speaking, (/ v ) v is a map from (A*)* to (A*)*, but we 
can identify (V*)* with V, because we consider only finite-dimensional spaces.) 

Let X, Y, Z be spaces. By jC(X, Y) we denote the space of all linear maps from X to 
Y , and by C 2 {X, Y ; Z) the space of all bilinear maps / : X xY — y Z. The spaces C{X, Y) 
and £ 2 (Ai, Y; Z) may be identified, in a canonical way, with A* ® Y and A* ® Y* <g) Z, 
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respectively (see [32], §4.2). Recall the description of this identification. Let l G A"* and 
y G Y. Consider the map pi iV : X —y Y, defined by 

<Pi,tv( x ) = K x )v- 

Clearly, pi :V is a linear map. Furthermore, the expression l(x)y is linear in all three arguments 
/, x, and y , and therefore the rule (l,y) i-> p^ y defines a bilinear map from A"* x Y to 
£(A, Y). By the universal property of tensor product there exists a unique linear map 
y> : A* <g) Y —* £(A", Y) such that p{l ® y) — p^ y for all l and y. 

Show that this p is an isomorphism. Let e 3 ,..., e m and f\ be bases of X and Y, 

respectively, and e 1 ,.. .., e m be the basis of A"* dual to (e*). Then {e 1 ® fj | 1 < i < m, 1 < 
j < n} is a basis of A* <g) Y. Put h t] = p(e l ® fj). It is easy to see that h t] is the linear map 
that takes e* to fj and takes ei to 0 for all l i. Clearly, { h t] \ i,j} is a basis of £(A, Y). 
Thus, p takes a basis of A *® Y to a basis of £( A, Y), and is therefore an isomorphism. The 
map y> is called the canonical isomorphism between A* ® Y and £(A, Y). 

We can define the isomorphism p : X* <g) Y* <g) Z — y C 2 (X,Y', Z) in a similar way. 
Namely, p is the unique linear map such that 

(<p(l <g>m<8> z))(x, y) = l(x)m(y)z V x G A, y E Y,z G Z, l G A*, m G Y* 

(the details are left to the reader). 

Let / G £(A, Y) (resp., / G £ 2 (A, Y ; Z)), and let h G A* ® A (resp., h G A* ®Y* ® Z) 
be a tensor such that <p(h) = /. This h is called the structure tensor of /, and will be 
denoted by /. 

Consider the group G = GL(X) x GL(Y). It acts on the spaces A* <g) Y and £(A, Y) as 
usually. That is, an element g = (gi, g 2 ) G G acts on A* <g) Y as gf ® g 2 , and the action of g 
on £(A, Y) is defined by g(f) = g 2 fgf 1 (we leave to the reader to show that this is indeed 
a left action). Similarly, the group G = GL{ A) x GL(Y) x GL(Z) acts on A"* <g) Y* <g) Z and 
on £ 2 (A, Y; Z). The element g = (g \, g 2 , gf) G G acts on A* ®Y* ® Z as gf ® gf g^, and 
the action on £ 2 (A, Y; Z) is described by the rule 

W))(x, y) = g 3 (f(gf\x),gf\y))) 

(i.e., g takes / to the map fi defined by fi(x,y) = g 3 (f (gf 1 (x), gf 1 (y)))] we may also write 
this as g(f) = g 3 o f o (gf 1 x gf 1 )). 

Proposition 4.13 Let G = GL( A) x GL(Y) (resp. G = GL{ A) x GL(Y ) x GL(Z)), and 
let (p : X* <S> Y —> £(A, Y) (^resp. p : X* ® Y* <g) Z —> £ 2 (A, Y\Z)) be the canonical 
isomorphism. Then p is an isomorphism of KG-modules. 

Proof. It was observed above that p is an isomorphism of vector spaces. By the observation 
preceding Lemma 4.3 it is sufficient to check that p is a homomorphism of AG-modules. 
We prove this statement only for p : X* <g) Y — y £(A, Y), leaving the second statement to 
the reader. 

We have to check that g(p(u)) = p(g(u)) for all g = (gi,g 2 ) £ G and u G A* ® Y. 
By linearity, we may assume that u — l ® y. The condition g(p(u)) = p(g(u)) means that 
(g(p(u)))(x) = ( p(g(u)))(x ) for all x G A. Thus, we have to show that 


{(di,92)(v(l ® y)))(x) = (p((gi, g 2 ){l ® y)))(x) 


(3) 



V.P.Burichenko, Symmetries of matrix multiplication algorithms. I 


23 


for all , 9l G GL{X), g 2 G GL(Y), l G X*, y G Y, and x G X. 

We have ((g^ g 2 )(ip{l®y)))(x) = {{gi, g 2 ){<pi, v )){x) (by the definition of ip) = g 2 (<pi, y (gf 1 x)) 
(by the definition of the action of G on C(X,Y)) = g 2 (l(gf 1 x)y) (by the definition of (pi jV ) 
= l(gf 1 x)g 2 (y) (because g 2 is linear, and l(gf l x) is an element of K). 

On the other hand, (</?((<q, g 2 )(l <8> y)))(x) = ((p(gfl ® g 2 y))(x) (by the definition of the 
action of G on X*®Y) = < p g \i my {x) (by the definition of ip) = (gfl)(x) ■g 2 y (by the definition 
of <pi, y ). Further, note that (^Z)(x) = {gfl,x) = {gfl, g^gf'x)) (as x = g^gf'x)) = (l,gf x x) 
(by the property of contragradient maps) = ^g^x). Hence (gfl)(x) ■ g 2 y = l(gf l x) ■ g 2 y. 

Thus, both the left-hand and right-hand sides of (3) are equal to l(gf l x) ■ g 2 y , and 
therefore (3) is true. □ 


4.5 The isotropy group of a bilinear map 

Let X, Y, and Z be vector spaces and let / G C 2 (X, Y; Z ) be a bilinear map. The group 
G = GL(X) x GL{Y ) x GL(Z) acts on C 2 (X,Y ; Z) in the way described in the previous 
subsection. The stabilizer of / in G with respect to this action will be called the isotropy 
group of /, and will be denoted by A(/). The reader can easily check that this definition is 
equivalent to the following: A(/) is the set of all triples (A, B, C) G G such that f(Ax, By) = 
Cf(x,y) for all x G X and y G Y. In other words, the diagram 


X x Y — f —> Z 


AxB 


C 


X x Y —Z 


must commute. 

Example. Let A", Y, and Z be three spaces, let U = C(X, Y), V = C(Y,Z), W = 
jC(X, Z), and let / : U x V —> W be the usual composition of mappings, i.e., f(x,y) = yx. 
Clearly / is bilinear. For g = (tq, g 2 , g 3 ) G GL(X) x GL{Y) x GL(Z) put R(g) = (A, B, C), 
where A : U —» U , B : V —>• V , and C : W —> W are defined by the rules Ax = g 2 xgf x , 
Bx = fiqaxqT 1 , and Cx = g 3 xgf l , respectively. Then it is easy to see that R(g) G A(/) for 
all g. Moreover, R : g hg R(g) is a group homomorphism. Later we will show that R is an 
epimorphism. 

The following proposition shows that the isotropy group of a bilinear map is closely 
related to the (small) isotropy group of the structure tensor of this map. 

Proposition 4.14 Let f : X x Y —» Z be a bilinear map and let f G A"* <g> Y* <g) Z be its 
structure tensor. Let ( A,B,C) G GL(X) x GL(Y) x GL(Z). Then ( A,B,C) G A(/) if and 
only if A v ®B y ®C G r°(/). 

Proof. By Proposition 4.13, the map h hg h is a G'-isomorphism from C, 2 {X,Y\Z) to 
X* ®: Y* (g) z, where G = GL{X) x GL(Y) x GL(Z). So g = (A, B,C) G G is in A(/) if 
and only if g fixes /. But g(f) = (A v ® B y ® C)f by the definition of the action of G on 
A* ® P* ® Z. □ 
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4.6 Type of a space of linear maps 

Let X and Y be vector spaces, dim X = m and dim Y = n. Let £ = £(X, Y), and L C £ 
be some space of linear maps from X to Y. It is natural to call spaces 

KerL = p| Ker/ = {x G X \ f(x) =0 V / G L} 

feL 


and 

Im L = E Im/ 

feL 

the kernel and image of L , respectively. 

Put Pi(L) = m — dim Ker L and /3 2 (L) = dim Im L. The pair /3(L) = (/?i(L), (d 2 (L)) will 
be called the type of L. 

Note that for any linear map / € £(X,Y) we have dim Ker / + dimlm/ = m, so for a 
one-dimensional subspace (/) C £(X,Y) we have /?i((/)) = fadf)) = dimlm/ = rk(/). 
Thus, /3(L) is a generalization of the rank of a linear map. 

Let / G £(X, Y), and let a G GL(X) and b G GL{Y ) be automorphisms of X and Y, 
respectively. Then bfa G £(X, Y). It is easy to see that 

Ker 6/a = a' 1 (Ker /), and Im bfa = 6(Im /). 

Hence, if r afc : £ —> £ is a linear transformation defined by T a ^(f) = bfa, then 

Ker r a fi(L) = a^ 1 (Ker£) and Imr ab (£) = b(ImL), 

for any subspace L C £. It follows that r a) 6 preserves the type: 

P(T a , b (L)) — f3(L) V£C£. 

The aim of this subsection is to prove the converse statement. 

Proposition 4.15 Let £ = £(X,Y), and let h : £ —» £ be a linear transformation such 
that 1 3(h(L )) = /?(£) for all subspaces L C £. Then there exist a G GL(X) and b G GL(Y) 
such that h = r a fe . 

Proof. It is useful to observe that the set of all transformations of the form T a j, is a group, 
because r a yr c4 = r acM , and r idxMY = id c(x,y), whence also (r ajb ) _1 = r a - i ) 6 -i. It is obvious 
that the group 

{Ta,b | a G GL(X), b G GL(y)} 

is the image of GL(X) x GL(Y) under the representation of the latter group on £{X,Y), 
described in Subsection 4.4. 

Take bases (ei,..., e m ) and (d\ ,..., d n ) of X and Y, respectively. Moreover, let (e 1 ,..., e m ) 
be the basis of X* dual to (ef). Let f l3 : X —* Y be the map defined by /p/ej) = d 3 , 
fij( e i ) = 0 when l ^ i. Then {/^ | l<i<m,l<j< n} is a basis of £. Moreover, 
fij = < fei^dj , where for / G X* and y G K 


<#,j/ = (p(l<8>y) :x ey l(x)y 



V.P.Burichenko, Symmetries of matrix multiplication algorithms. I 


25 


is the linear map, described in Subsection 4.4. 

It was noted above that ft((f)) = (rk(/), rk(/)) for any / G C(X,Y). So h preserves the 
rank: rk (h(f)) = rk(/) for all / G £. 

Put qij = h(fij ). As fij are of rank 1, and {/p} is a basis of £, it follows that all q VJ are 
also of rank 1 , and 

{q^ | 1 < i < m, 1 < j < n} 

is a basis of £. 

It is easy to see that the maps / G £ of rank 1 are precisely the maps of the form ip^ y . 
In particular, there exist Uj G X* and y tJ G Y such that q l3 = tpi ijmj for all 1 < i < m, 
1 < j < n. 

Show that lij 1 and l h j 2 G X* are proportional, for any i, ji , and jy in other words, 
the line (l tJ ) C X* depends only on i. Consider the space L = (fi.j 1 , fi,j 2 ). Obviously, 
KerL = (e/ | l i) has codimension 1 in A"*, so the kernel of h(L) = (q i j 1 ,qij 2 ) must be 
of codimension 1 also. Now observe that Ker(pi ty = Ker/ and limply = ( y ), for any l G A"* 
and y E Y such that l,y ^ 0. In particular, Ker q^ = Ker /^ and Irn q t] = ( y iy ), for all i 
and j. Hence Ker (q i j 1 ,qij 2 ) = Ker l l ]l fl Ker l h j 2 . If and l h]2 are not proportional, then 
the latter intersection is a subspace of codimension 2 in A"*, a contradiction. Thus, l t)]l and 
lij 2 are proportional. Therefore, there exist h,...,l m G X* such that (hj) = (f), for all i 
and j. 

Similarly, one can show that (y y ) depends only on j. Indeed, for given £ i 2 , 1 < 

*i,* 2 < m, consider the space L = fi 2 ,j)- Its image is ( dj ). So the image of the space 

(q U)V q l 2 : j) = h(L ) is of dimension 1 also, whence (y^y) = (yi 2 ,j), whence (y t j) = (yj), for 
some yi,... ,y n G Y. 

Thus, we see that qij = = Kj^u, yv for some \ l3 G I<*. 

Show that (yi, ..., y n ) is a basis of Y. Otherwise, (yi ,..., y n ) = Y' is a proper subspace 
of Y. But the image of any q^ lies in Yf so the image of the whole £( X, Y) = (q l3 \ 1 < i < 
m, 1 < j < n) also lies in Y', which is impossible when Y' Y. 

Similarly, l m ) is a basis of X*. Indeed, if this is not the case, then there exists an 

element x G X, x ^ 0, annihilated by all q^, i.e., by the whole £(AT, Y). 

Further, we “normalize” basis {q^} in an appropriate way. To do this, we need to know 
how r a) fc acts on <p^ y . Prove that 


T a ,b(Vl,y) ~ Ya*{l),b{y) , ( 4 ) 

where a* G GL(X *) is the map dual to a. Indeed, for all x G X we have (T a ^(p>^ y ))(x) = 
(bp>i iy a)(x) = b(tpi'y(a(x ))) = b(l(a(x))y ) = l(a(x))b(y). Since l(a(x )) = ( a*(l))(x) by the 
definition of the dual map, we finally get 

(T a ,b(vi,y))(x) = (a*(l))(x)b(y) = <p a *(i),b(y )( x )» 

which proves formula (4). Let b\ G GL{Y ) be the element taking basis (yi, ■ ■ ■ ,y n ) to 
(di ,..., d n ), let c G GL(X*) takes (£,..., l m ) to (e 1 ,..., e m ), and a\ = c* G GL(X ) be the 
map dual to c. Then r aubl takes tp tuyj to (paiqpMvj) = Vefdj = fij■ As Qij = ^ijYh^j, we 
see that the map h' = T ai ^ bl h takes f t j to \jfij- Moreover, h' preserves types of subspaces. 
Thus, it remains to show that a map h taking f tJ to Kjfij, where \j G K*, and preserving 
the types, must have the form r a y. 
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Multiplying h' by a scalar we may assume that An = 1. Put fi t = Xu, i/j = Ay (whence 
pi — ui — 1), and show that Ay = /ipy,. This is evident if i = 1 or j = 1, so suppose i,j > 1. 
Consider / = / n + fn + /y + /y G £, then h\f ) = / n + gifn + z/,-/y + Ay/y. As rk(/) = 1, 
we must have rk(/r'(/)) = 1, which is the case only if Ay = /ipy,. 

Let a and 6 be the linear maps on A" and K such that a(e;) = /qe* and &(o/) = z/jdj. 
Then it is easy to see that 6/y-a = Hi u jfij — \jfij — h'(/y), for all i and j, and therefore 
h' = T a:b . □ 

4.7 Further lemmas on linear maps 

Lemma 4.16 Let U and V be spaces, ■ U V be linear maps such that 

fl” = jKer f t = 0. Then for any linear function l : U K there exist linear functions 
h : V —* K such that l = Ym=i h ° fi ■ 

Proof. Let T C U* be the set of all linear functions of the form ]C" =1 l t o f,, for some U G V*. 
Clearly, T C U* is a subspace. To prove that T — U* it is sufficient to show that there is no 
a non-zero vector u E U such that t[u) = 0 for all t G T. 

Let u G U, u ^ 0. As fl” =1 Ker /* = 0, there exist i such that /,:(«) ^ 0. Next, there 
exist a linear function m G V* such that m(fi(u)) ^ 0. Then m' — m o f, G T and 
m'{u) = (m o fi)(u) = m(fi(u )) ^0. □ 

Lemma 4.17 Let U, V, and W be three spaces, fi ..., f n G C(U,V) arid f G C(U,W). 
Then f can be represented in the form f = g\f\ + ... + g n f n , for some g, G C(V, W), if and 
only if n” = , Ker f, C Ker/. 

Proof. Denote fl” = , Ker f t by S. It is clear that if / can be represented in the form / = 
gifi + ... + g n f n , then S C Ker/. We need to prove the converse implication. 

First assume that S — 0 and dim IK = 1. Then we may identify W with K , so that / 
and gt became linear functions on U and V, respectively, and the desired statement follows 
from Lemma 4.16. 

Next consider the case when S — 0, but W may be of arbitrary dimension. Let 
(wi ,..., w m ) be a basis of W. Then / decomposes as 

m 

/ = ]//», 

3 =1 

where f® G C(U, (■ Wj )). According to the case A = 0 and dim IK = 1, there exist linear 
maps gf^ : V (wj), where 1 < i < n, 1 < j < m, such that f^ = Y^i=i f% f° r 
j = 1,..., m. Now it is sufficient to take g % = J2"j'=i tiP■ Thus the case S = 0 is settled. 

Finally consider the general case, when S may be nontrivial. Put U = U/S, and let 
cp : U —> U be the canonical factorization mapping. As / and all fi vanish on S, there exist 
unique maps / G C(U, W) and f { G C(U, V ) such that / = ftp and fi = f^. It is clear that 
Ker/ = (Ker /)/S and Ker f i = (Ker ff)/S. 
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As n” =1 Ker/j = S, we have fl" =1 Ker f i = 0. Applying the previous case we see that 
/ = X/"=i 9i.fi f° r some linear maps gi : F — > W. Hence 

n n n 

f = f<P = (X SifiW = X = X 9i f* ■ 

i =1 i= 1 i=l 


□ 


The next lemma is, in a sense, dual to the previous one. 

Lemma 4.18 Let U, V arid W be three spaces, G C(V,W), and f G £(U,W). 

Then f can be represented in the form f = Ym=\ fi9i> f or some 9i £ £(U, V), if and only if 

I m / C ElLi Im /i- 

Proof. Denote Im/i by S'. It is clear that if / can be represented as / = J2'i=ifi9o 
then Im / C S'. Prove the converse implication. Suppose that Im / C S. There exists a 
basis wi ,..., w rn of S such that each Wj has the form Wj = f p (vj), for some p = p(j) and 
Vj G V. Next, let lj : U K be the (uniquely defined) linear functions such that 

m 

/(«) = 

3 =1 

for all u G U. Now for j = 1 ,... , m define linear maps hj : U —> V by hj(u) = lj{u)vj. Then 

mm m m 

f ( u ) = X*>K- = = Xte(^(“)^) = 

i=i j=i i=i j=i 

for all u G U. That is, / = )T)X fp(j)hj■ But 

m n 

X! fpu) hj = X] f i9i ’ 

j=l *=1 

where 9i = E {j \ p{j) = i} hj • □ 

It is convenient to rewrite results of Lemmas 4.17 and 4.18 in a slightly different form, 
using the notions of kernel and image of a space of linear maps. 

For subspaces A C C(U, V ) and £? C £(V, IF) define subspace BA C £(U, W ) by 

HA = (ba | a G A, b G H). 

In particular, we can consider subspaces £(V, W)A and BC{U, , F) of £(f/, IF). 

Proposition 4.19 Let U, V, and W be three spaces, A C C(U,V) and B C C(V,W) be 
subspaces, and let f G C(U, W). Then the following statements hold. 

1) f G £(F, IF) A if and only if Ker / D Ker A. 

2) f E BjC(U , F) if and only if Im / C Im B. 

We leave to the reader to deduce statements 1) and 2) from Lemmas 4.17 and 4.18, 
respectively. 
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4.8 The isotropy group of the map of composition of linear maps 

Let U, V and W be three vector spaces, and let 


<p : C(U, V ) x C{V, W) ->• C(U, W ) 

be the usual composition of mappings, i.e., (p(x,y) = yx. The aim of the present subsection 
is to find the isotropy group A(</?). 

For subspaces A C jC(U, V) and B C C(V, W) we define their annihilators by 

ann(H) = {h £ £(V, W) \ hA = 0} 


and 

ann (B) = {h £ C{U, V ) | Bh = 0}, 


respectively. 

For a subspace L C C(U,V) or L C £(V, W) let j3(L) 
described in Subsection 4.6. 

We need a lemma. 


(j3i(L), faiL)) be its type, as 


Lemma 4.20 For subspaces A C C(U, V) and B C C{V, W) the following equalities hold: 

(1) dim C{y, W)A = p!(A) dim W; 

(2) dim ann(4) = (dim V — P 2 (A)) dim IF; 

(3) dim BC(U, V ) = p 2 (B) dim U; 

(4) dimarm(L)) = (dim V — Pi(B)) dim U. 

Proof. (1) Let S = KerA. By statement 1) of Proposition 4.19 C(V,W)A consists of 
all h £ C(U,W) such that h(S) = 0. Hence dim C{V,W)A = (dim U — dim S) dim W = 
P\{A) dim W. 

(2) Let T = 1m A. Consider / £ C(V, W). If /(T) = 0, then f A = 0, that is, / £ ann(H). 
On the other hand, if f(T) ^ 0; then there exist u £ U and h £ A such that f{h{u )) ^ 0, 
whence ( fh){u ) ^ 0, and therefore fh ^ 0 and / f ann(H). Thus, / £ ann(H) if and only 
if f(T) = 0. Hence dim ariri(H) = (dim V — dim T) dim W = (dim V — P 2 (A)) dim W. 

(3) Let T = Im B. By statement 2) of Proposition 4.19 an element / G C(U,W) 
is in BC(U,V) if and only if Im/ C T, whence BjC(U,V) = C(U,T), and therefore 
dim BC(U,V) = (dim U) (dim T) = (dim U)p 2 {B). 

(4) It is easy to see that ann(H) consists of all / £ C{U , V) such that Im / C Ker B. Hence 
ann(H) = C(U, Ker B) and dim arm(H) = (dim U) (dim Ker B) = (dim V — Pi(B)) dim U. □ 


We also need the following lemma, whose proof is left to the reader. 

Lemma 4.21 Let U, V, and W be three spaces, and let a,b £ C{V, V ) be maps such that 
yax = ybx for all x £ jC(U, V ) and y £ C(V, W ). Then a = b. 

Recall that the center of the group GL(V) consists of scalar transformations, for any V, 
see the remark after Proposition 4.8. 

Proposition 4.22 Let X, Y and Z be vector spaces, U = C(X,Y), V = C{Y,Z) and 
W = jC(X,Z), and let A £ GL{U), B £ GL{V) and C £ GL(W ) be transformations of U, 
V and W such that 


(Bv)(Au) = C(vu) V u £ U, v £ V. 


(5) 
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Then there exist p G GL(X), q G GL{Y), and, r G GL(Z) such that A, B and C are given 
by rules Au = qup -1 , Bv = rvq -1 , and Cw = rwp -1 , for u G U, v G V and w G W, 
respectively. 

Proof. First prove that A and B preserve type of subspaces of U and V, respectively. Clearly, 
(B(N))(A(M)) = C(NM) for any subspaces M C U and N C V, whence dim B(N)A(M) = 
dim NM, for all M and N. In particular, dimVA(M) = dim VM for all M. Apply¬ 
ing statement 1) of Lemma 4.20 we see that /3y(A(M)) dim Z = /3y(M)dimZ, whence 
Pi(A(M)) = Pi(M) for any M. 

Also, B(N)A(M) = 0 if and only if NM = 0, whence one easily sees that ann (A(M)) = 
_B(ann(M)), whence 

(dim Y — f3 2 (A(M))) dim Z = (dim Y — /3 2 ( M )) dim Z, 

whence finally fd 2 (A(M)) = j3 2 (M). Thus, A preserves type of subspaces in U. Similarly one 
can prove that B preserves type of subspaces in V. So by Proposition 4.15 A and B have the 
form Au = qup -1 and Bv = rpvqf 1 for some p G GL(X), q,qy G GL(Y) and rq G GL{Z). 
Substituting these explicit formulae into condition (5), we see that 

(ryvqf 1 )(qup -1 ) = C[yu ) V u G U, v G V. 

For any d G GL(Y ) we have (ud)(d~ 1 u) = vu, for all u G U and v G V, whence 

(ri vdqf 1 )(qd~ 1 up~ 1 ) = C(vd■ d~ x u) = C(yu) = (ry vqf 1 )(qup~ 1 ). 

Multiplying by r^ 1 on the left and by p on the right, we see that vdq^qd^u = vqf 1 qu, 
for all u G U and v G V. Applying Lemma 4.21 we see that dqf l qd~ l = qf l q. Thus, the 
element qf l q G GL(Y ) commutes with all d G GL{Y ) and so must be a scalar, whence 
qy = A q, for some A G K*. So we can rewrite the formula for Bv as Bv = ryvqf 1 = rvq -1 , 
where r = A _1 r 1 . Thus, we have Au = qup -1 and Bv = rvq -1 , for all u and v. Now 

(rvq -1 )(qup -1 ) = C{vu), V u G U, v G V, 

whence C{vu ) = rvup -1 for all u and v. Since (vu \ u G U, ,v G V) — W, we see that 
Cw = rwp -1 for all w G IT. □ 

The following statement may be considered by the reader as an evident consequence of 
Proposition 4.22. Nevertheless, we give a formal proof. 

Corollary 4.23 Let Ny = M nm , N 2 = M pn , N 3 = M pm , and suppose that transformations 
Ai G GL(Ni), i = 1,2,3, satisfy relations (A 2 x 2 )(AyX\) = A 3 (x 2 xy) for all xy G Ny and 
x 2 G N 2 . Then there exist elements s G GL m (K), q G GL n (K), and r G GL P (K ) such that 
Ayx i = qxys -1 , A 2 x 2 = rx 2 q -1 , and A 3 x 3 = rx 3 s -1 , for all x t G N l; i = 1, 2, 3. 

Proof. Let X = K m , Y = K n , and Z = K p be the column spaces. First we identify, in a 
usual way, Ny, N 2 , and N 3 with £( X, Y), C{Y, Z), and £( X, Z), respectively. 

Namely, for h G Ny let ip(h ) G C(X,Y) be the multiplication by h, i.e., ((p(h))(x) = hx, 
for all x G X. Equivalently, for a map g G C(X,Y) the corresponding element tp -l (g) G Ny 
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is just the matrix of g with respect to standard bases in X and Y. It is clear that p : N\ —> 
jC(X, Y ) is an isomorphism of linear spaces. 

In a similar way we identify N 2 with £(Y,Z), and IV 3 with £(X,Z). Moreover, we 
identify GL(X), GL(Y ) and GL(Z) with GL m (K ), GL n (K ) and GL p (K), respectively. It 
is convenient to use the same symbol ip for all these identifications. Thus, ip is a bijection 
from 

/ = N x U N 2 U N 3 U GL m (K) U GL n (K ) U GL p (K) 
to 

J = £(X, Y) U £(Y, Z ) U £(X, Z) U GL(X) U GL(Y) U GL(Z). 

Further, it is easy to see that p preserves multiplication, for example, p{x 2 )p{xi) = 
(p(x 2 Xi) for any x\ G N\ and x 2 G N 2 . Generally, if x, y G I, and if at least one of two 
expressions xy and p(x)p{y) is defined, then the other one is defined also and p(x)p(y) = 
pixy). Moreover, if x, y G J and at least one of two expressions xy and p~ l {x)p~ l iy) is 
defined, then the other one is defined also and p~ 1 (x)p~ 1 (y) = p~ l {xy). 

Let B 1 , B 2 and £>3 be the transformations of the spaces £{X, y), £{Y, Z), and £(X, Z), 
corresponding to Ai, A 2 and A 3 under ip\ that is, Bi = pAip^ 1 . Then for any y\ G £(X, Y) 
and y 2 G £(Y, Z) we have (B 2 y 2 )(Biy 1 ) = B 3 (y 2 y 1 ). Indeed, 

(B 2 y 2 )(B l y l ) = ((pA^-^y^^pA^-^yx) = (p(A 2 (p- l (y 2 ))))(p(A 1 (p~ 1 (y l )))) 

= v(M(p~ l {y 2 yi))) = {pA 3 p- 1 )(y 2 y 1 ) = B 3 (y 2 yi). 

Applying Proposition 4.22, we see that there exist si G GL(X), (g G GL{Y ) and r 1 G GL(Z) 
such that Bpx = qixsf 1 , B 2 x = rxxqf 1 , and B 3 x = rixsf 1 , where x G £(X, Y), £(Y, Z), or 
£(X, Z), respectively. Therefore for any x G Ni we have 

A x x = (p~ 1 B 1 p)(x) = p~ 1 (B 1 (p(x))) = p~ 1 (q 1 p(x)sf 1 ) 

= p~ 1 (q 1 )p~ 1 (p(x))p~ 1 (sx 1 ) = qxs -1 , 

where s = and q = p~ 1 (qi). In a similar way one can prove formulae for A 2 x and 

A 3 x (with r = 93 _ 1 (r 1 )). □ 


4.9 Proof of Proposition 4.8 

We start with the following observation. Let x and y be a x b and b x a matrices, respectively. 
Then Tr (xy) = Tr (yx). Moreover, 

(x,y) (x,y) = Tr (xy) = Tr (yx) 

is a nondegenerate bilinear pairing between M a b and Mb a - Therefore we may identify M* b 
with Mba , and M ba with M ab . 

Further, the group G = GL a (K) x GLb(K) acts on both M a b and M ba in a usual way, 
that is, g = (<71,172) takes x G M a b and y G Mb a to gixg^ 1 and g 2 ygf 1 , respectively. The 
pairing is invariant under this action. Indeed, if x G M a b, y G Mb a: and g = (< 71 , 02 ) G G, 
then 

(gx,gy) = Tr(( gixg^faygT x )) = Tr (g&ygi 1 ) = Tr (xy) = (x,y). 



V.P.Burichenko, Symmetries of matrix multiplication algorithms. I 


31 


Therefore the transformations, induced by g on M a b and M^a, are contragradient each to the 
other. 

Let L\ = M mn , L 2 = M np and L 3 = M prn be as in the hypothesis of the Proposition, and 
let Ni = M nm and N 2 = M pn . Then N z is dual to Lj, i = 1,2. Let ip : N 3 x N 2 —» L 3 be 
the usual product map, that is, tp(x,y) = yx. Its structure tensor fp G N* ® N£ ® L 3 may 
be considered as an element of L 3 ® L 2 ® L 3 . We show that <p — t — (m, n,p). 

Indeed, we have 

t ^ ^ C,j ® ^jk ® efci . 

l<i<m, 1 <j<n, l<k<p 

Let 

i >; l x ® l 2 ® l 3 = n* <g> n; ®L 3 —> £ 2 (Ni, n 2 - L 3 ) 

be the canonical map, described in Subsection 4.4 (denoted by p there). We must show that 
the bilinear map p = ip(t) coincides with ip. The bases of N 3 and N 2 are {ei q | 1 < l < 
n, 1 < q < m} and {e rs \ 1 < r < p, 1 < s < n}, respectively. It follows from the definition 
of ^ that the value of p on the pair ( ei q , e rs ) equals 

^ ^ Tr ((' jj Ciq ) Tr ( CjfaC r s ) C /. ^ ^ 4 //&iq 11 hrski ^ ^ $jl&js^rq ^Is^'rq • 

1 <i<m 1 <i<m 1 <j <n 

1 < 7 <n 

l<fc<p 1 <k<p 

On the other hand, (p(e iq , e rs ) = e rs e iq = 5 s ie rq . Thus, p(ei q , e rs ) = p(e iq , e rs ) for any /, q, r, 
and s, that is, tp = p. Thus, t = fp. 

Return to the proof of the proposition, and assume that g G T°(f). We have g = A\® 
A 2 ®A 3 , for some A * G GL(Li), i — 1, 2, 3. For i = 1, 2 we put Bi = G GL(L*) = GL(Ni). 
Then Aj = R t v , i = 1, 2. So we have <g) B\ <g) A 3 G T°(^). Now Proposition 4.14 implies 
that (Bi,B 2 ,A 3 ) G A(<^). In other words, (B 2 y)(B 3 x) = A 3 (yx) for any x G M nm and 
y G M pn . By Corollary 4.23, there exist a G GL m (K ), b G GL n {K) and c G GL P (K) such 
that Ri, i ?2 and A 3 are defined by the rules B 3 x = bxa -1 , = cxfe -1 and A 3 x = crra -1 , 

where i G Aj, JV 2 , or f 3 , respectively. 

It follows from the discussion in the beginning of the proof that the transformation on 
Li, contragredient to transformation x hg bxa -1 on N \, may be described by the formula 
x i— y axb~ 1 . Similarly, A 2 acts by the rule x (->• bxc~ 1 . Therefore, g acts by 

A(x ® y ® z) = axb _1 ® bye -1 ( 8 ) cza~ l . 


That is, g = T(a,b,c). □ 

5 Automorphisms of Laderman algorithm 

In this section we find automorphisms of the Laderman algorithm. The structure of the 
section is as follows. First we recall Laderman algorithm in its computational form, and 
rewrite it in the tensor form. Then we produce a certain subgroup G of T(f), the isotropy 
group of t = (3, 3, 3). Then we check that G preserves the algorithm. Finally, we prove that 
G is the full automorphism group of the algorithm. In the end of the section we give a less 
formal explanation on how G was found. 
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5.1 Laderman algorithm 

Recall the description of the Laderman algorithm in computational form, according to [ 33 ]. 
Let 

(x 11 x 12 x 13 \ A/ll 2/12 yi 3 \ 

X = x 2 i x 22 x 23 and Y = y 21 y 22 y 23 . 

\x 3 l x 32 X 33 ) \?/31 2/32 2/33 / 

Consider the products 

Pi = Ani + xi 2 + xi 3 - x 21 - x 22 - x 32 - x 33 )y 22 , p 2 = (xn - x 21 )(-y 12 + 2/22), 

P 3 = x 22 (—yn + 2/12 + 2/21 — 2/22 — 2/23 — 2 / 3 i + 2/33), Pa = (—£11 + X21 + 2^2) (2/11 — 2/12 + 2/22), 
P 5 = (a?2i + ^22)(-2/11 + 2/12), Pd = xuyn , P 7 = (—a?ii + x 3 i + x 32 )(y u - 2/13 + 2/23), 
p 8 = (-ahi + £31X2/13 - 2/23), P 9 = (£31 + x 32 )( 2/11 + 2/13), 

P10 = (X11+X12+X13—X22—X23—a; 3 i — ^2:32)2/23 , £>11 = 3 ^ 32 (—2/11+2/13+2/21 —2/22—2/23— 2 / 3 i +2/32), 

P12 = (-£13 + 2:32 + X33) (2/22 + 2/31 - 2/32), P13 = (^13 - 2:33) (2/22 - 2/32), Pl 4 = 21132/31 , 

Pl5 = (2132 + 2:33) (- 2/31 + 2/32), Pld = (-2113 + 2122 + X 23 )(y 23 + 2/31 - 2/33), 

Pl 7 = (2113 - 2123 ) ( 2/23 - 2 / 33 ), Pl 8 = (2122 + 21 23 ) (- 2/31 + 2 / 33 ), Pl 9 = 21 i 2 2/21 , 

£>20 = 21232/32 , £>21 = 21212/13 , £>22 = 21 3 i 2 /l 2 , £>23 = 21332/33 • 

Then one can check that the coefficients of the matrix 

/ Z11 ^12 ^13 
Z — XY = I 221 ^22 £23 
\z 3 i Z 32 £33 

can be computed according to the formulae 

Z 11 = £>6 + £>14 + £>19 , 

312 = Pi + Pa + £>5 + Pe + P12 + £>14 + £>15 , 

213 = £>6 + £>7 + £>9 + £>10 + £>14 + £>16 + £>18 , 

221 = £>2 + £>3 + £>4 + £>6 + £>14 + £>16 + £>17 , 

222 = P2 + Pa + £>5 + P 6 + P20 , 

223 — £>14 + £>16 + £>17 + £>18 + £>21 , 

231 — £>6 + £>7 + £>8 + £>11 + £>12 + £>13 + £>14 , 

232 = £>12 + £>13 + £>14 + £>15 + P22 , 

233 = Pd + Pi + P 8 + P 9 + P 23 ■ 

(In [ 33 ] there are only a few words on how this algorithm was found. The author of [ 33 ] 
promised to publish more detailed description of his approach, but this was never done.) 
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Write Laderman algorithm in tensor form. Let M = M^(K) (= M^K), in traditional 
notation, since we consider square matrices), put L\ = L 2 = L 3 = M, and put next L = 
L\ <g) L2 <8> L3. Consider the following elements of L : 

t\ = (en + ei2 + ei3 — e2i — e22 — £32 — £33) <g> e22 <£> £21, 

£2 = (fin — £21) <£> (—£12 + £22) <£> (ei2 + £22), 

£3 — ^22 ® ( — ^11 + £12 + £21 — e22 — 623 — £31 + £33) <E> £12 , 

L1 = (—£11 + £21 + 622) ® (fill — £12 + 622) ® (£21 + ei2 + e 22) , 

£5 = (£21 + £22) <£> ( — en + £12) < 8 > (£21 + £22) f 
^6 — e ll ® £11 <E> (fill + £21 + £31 + e 12 + ^22 + ^13 + £33) , 

£7 = ( — £n + e 3 i + 632) <E> (fin — £13 + 623) ® (£31 + £13 + 633), 

£s — ( — en + £31) ® (£13 — 623) £3 (£13 + £33 ), 

£9 — ( e 3 i + ^32) ® ( — eii + 613) <£) (£31 + £33) , 

£10 = ( e ll + e 12 + Ci3 — £22 — 623 — £31 — £32) <£> e 23 ® £31 ) 
tn = £32 <£> ( — en + ei3 + e2i — e22 — e 23 — e3i + 632) <S> £13 , 
tl2 = ( — £13 + e 32 + 633) <£> (£22 + 631 — £32) <8> (£21 + Ci3 + £23) ) 
tl3 — ( e 13 — 633) <8) (£22 — 632) <E> (fil3 + 623) ) 
tl 4 = ^13 < 8 > £31 ® (fill + £21 + £31 + £12 + £32 + ei 3 + 623) , 

1 15 = (e 32 + £33) ® ( £31 + 632) < 8 > (fi 21 + £23) , 
tl6 — ( —e 13 + e 22 + 623) ® (fi 23 + £31 — £33) <E> (£31 + Cl2 + 632) , 
tl 7 = (Cl 3 — 623) (fi 23 — ^33) <8) (fil2 + £32) , 

tl 8 — ( e 22 + ^23) ® ( — £31 + 633) ® ( e 31 + 632) , 
tl9 — 612 <£> £21 ® Cn , ^20 = e 23 ® £32 ® £22 , 

£21 = £21 <£> £13 632 , £22 = e 3 i ® £12 £23 5 

f 23 = e 33 ® £33 £33 • 

Proposition 5.1 The set C — {£1,... ,£23} is the tensor form of the Laderman algorithm. 

Proof. A direct computation following discussion in Section 2. Alternatively, the reader can 
check that the sum ti + ... + £23 coincides with t = ( 3 , 3 , 3 ). □ 
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5.2 A subgroup of V(t) 

For a,b,c G GL 3 (K ) let T(a,b,c ) : L —> L be the transformation, described in Subsec¬ 
tion 4.2, defined by 


T(a,b,c ) : x <g) y ® z h->- axb 1 <g) bye 1 <g) cza 

Introduce notation for several special elements of GL 3 (K). Let 

/0 1 0 \ 

7 Ti 2 = 1 0 0 = e 12 + e 2i + e 33 

\0 0 1 / 


be the transformation, intrehanging the basis vectors e 3 and e 2 , and define 7 t 13 and 7r 23 
similarly. Also put £i = diag(—1,1,1), £ 2 = diag(l, —1,1), and £ 3 = diag(l, 1, —1). Note 
that all matrices 7and £* are symmetric, and £ 8 and Xjk commute, if {i, j, k} = {1,2,3}. 

Further, introduce the following decomposable automorphisms <f>j, i = 1,2, 3,4, of L. 
Put 

$i = T(7T 23 , 7 Ti 3 , 1) : x ® 1/ <g> z H- n 23 X7T 13 <g) 7T 13 t/ <g) £7T 23 , 


and 

<h 2 = T( tt 23 , 1,7T 23 ). 


Next, define <h 3 and $4 by 


< h 3 (x <s> y <8> 2 ) = y t e 2 <E) £ 2 ^ <E) 2 * 

(where x H> a; 4 is the transpose map) and 

<F 4 (a; <g) y ® z) = £i^7Ti 2 <g) 7Ti 2 a;7ri 2 £i <g) £i7Ti 2 7/£i . 

Proposition 5.2 The following relations hold: 

${ = $2 = $2 = $3 = = $ 2 $ 1 ; $ 3 < 1 > 1 < I > 3 = < F 1 $ 2 , 

< h 3 < h 2 < h 3 = $2 , $4 $i$ 4 1 = $1$2 , $4$ 2 $ 4 1 = $1 , 

$ 3 $ 4 $ 3 = < h 4 1 . 

Proof. The relations ${ = $2 = 1 and < hi < h 2 = < h 2 < f > i hold because 7r 13 and 7r 23 are elements 
of order 2 in GL 3 (K) and (a, b, c) 1 —> T(a,b,c ) is a homomorphism from GL 3 (K) X 3 to T(t), 
as was observed in the proof of Proposition 4.5. The remaining relations may be proved by 
a direct calculation. Show, for example, that <h 4 = 1 and < f> 3 < hi < l ) 3 = < hi < h 2 . 

We have 

$4 

x®y®z 1-4 £iZ7Ti2 <8> 7r 12 x7r 12 £i <8> £i7r 12 y£ 1 

& £l(£l7Tl2y£l)7r 12 ® 7ri 2 (£i^7Ti 2 )7ri 2 £i <g) £i7Ti 2 (7ri 2 X7ri 2 £i)£i. 

Simplify the latter expression. We have 

£i(£Wi2Z/£i)tP2 = — 'K\ 2 y£\'K\ 2 , 
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'KlliyiZ'Kyil'KYzEi — X\2£\Zx\ 2 £\ — X\2£\Z£\ , 

and similarly 

^l 7r 12( 7r 12^7ri2^l)^l = £\XTli2 , 

whence 

®y®z) = xi 2 y£iXi 2 ® n^izSi ® £ixtti 2 . 

Consequently, 

$ 4 ( 0 ; ®y®z) = £1 (£ 1 X 7142)^12 < 8 > x X 2 i yX\2y£\X\2)'X\2£\ ® £i7Ti2(7ri2£i^£i)£i = X<g)y®z, 

so the equality $4 = 1 is established. 

Now we check that < h 3 < l ) i < h3 = $i$2- We have 

X®y®Z e-f y t £2® £2X t ® z l 'K2‘z(y t £2)'Kl3®'Ki3£2X t ® Z t Tl23 

^ i^lz£2X t ) t £2 ® £ 2 (^ 23 (^£ 2 )^ 13 )* ® (^^ 23 )*- 
Simplify the latter expression. We have 


(7ri 3 £ 2 ^) t £2 = (x t ) t £|7T4 3 £2 = ^2^13^2 = 2^13 

(because 7 t 13 and e 2 are symmetric and commute). Similarly ^(^(z/^Vis)* — ^2^\z£2y^23 = 
7Ti3y7r23, and (^^23)* = 7^32. After all we obtain 

($ 3 $i$ 3 )(x ®y®z) = xtt 13 ® Ti 13 yTT 2 3 <g) X 23 Z = T( 1, 7Ti 3 , 7r 23 )(x ® y <8> z), 

whence < f> 3 <f> 1 $ 3 = T(l,7 t 13 ,7t 23 ). It remains to observe that 


T(l, 7 Ti 3 , 7T 23 ) = T(7r 23 , 7T 13 , 1)T(tT23, 1, 7T23) = &1&2- 


□ 


Lemma 5.3 Let G be a group containing four elements a i, 02, a 3 , 04 such that 

( 1 ) ai, 02, a 3 , 04 generate G, 

( 2 ) ai satisfy relations 

O4 = Q<2 — 0 3 — Q4 — 1, O1O2 = 02®1 , 0 3 0l0 3 — Q1O2 , (6) 

a 3 a 2 a 3 = ®2 , Q4Q1Q4 ' = 04O2 , 0402^4 — Oi, a 3 a 4 a 3 = 04 ', 

and 

( 3 ) ai 7^ 1 . 

Then G ^ S 4 . 

Proof. First observe that S4 contains elements 61, 62, hi , 64, satisfying relations (6) (with a., 
replaced by /y), namely 61 = (12) (34), b 2 = (13) (24), 6 3 = (13), and b 4 = (123) (checking 
relations (6) is left to the reader). Also note that the system of relations (6) is equivalent to 
the following system of relations: 

2_ 2_ 2_ 3_1 _ _ 

O4 — o 2 — o 3 — Q4 — r, 02G1 — 01O2 ; 03O1 — a±a 2 a 3 , 


(7) 
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_ _ _ _ 2 
O3O2 — O2®3 ; O4Q4 — 0.1(12(14 , O4O2 — Q4CI4 , CI3O4 — O4O3 . 

It follows that if G is any group, satisfying conditions ( 1 ) and ( 2 ) of the lemma (but not 
necessary ( 3 )), then any element of G can be written in the form 

a l f a l f a l f ci'f , (8) 

where 0 < I1J2J3 < 1 and 0 < l 4 < 2. Indeed, any element of G can be represented as a 
product of elements a*, for example 

Cl2®3 ^4 = 02030 4 0 4 0\ • 

Next, using relations a\ = 1 , 0304 = 040203, 0302 = 0203, and 0304 = 040.3, we can transform 
such a product to the form M3, where to is a word involving only 0.4, 02, and 04, and 
Z3 G { 0 , 1 }. For example, 

O2O3O4O4O1 = O2O4O4O3O4O1 = O2O4Q.4O4O4O3Q4 = O2O4O4O4O4O1O2O3 = O2O4Q4O2O3 

(in this chain of transformations we underline, in each step, the piece of product being 
transformed in this step). Similarly, using relations 040.4 = 0.40204 and 0.402 = 0404, we can 
drag all 04 involved in w to the right and obtain an expression of the form va l f, where v is 
a word involving only 0.4 and 02, and 0 < I4 < 2 . Finally we can transform v to the form 
a) 1 o.2, where 0 < Z4, Z2 < 1 , using relations a\ — a\ = 1 and 0204 = 0402- After all these 
transformations we arrive to the word of the form (8). 

Therefore, any group G that satisfies conditions ( 1 ) and (2) is of order < 2 • 2 • 3 • 2 = 24 ; 
in particular, G is finite. 

It is well known that the normal subgroups of the symmetric group S4 are the following: 
the trivial group {e}, the Klein four-group 

V = {e, ( 12 )( 34 ), ( 13 )( 24 ), ( 14 )( 23 )}, 

the alternating group A4, and the full S4. In particular, any nontrivial normal subgroup of 
S 4 contains V. Observe also that (64,62) = V, b 4 G A 4 — V, and 63 G S 4 — A 4 . Hence it is 
easy to see that (64, b 2 , 64) = A 4 and (64, 6 2 , 63, 64) = S 4 . 

Consider the direct product H = G x S 4 , and let Xi : H —* G and 7 t 2 : H —» S 4 be the 
projections onto factors. Consider next the elements c l = {a^bf) G H,i — 1 , 2 , 3 , 4 , and put 
K = (c4,c 2 ,C3,c 4 ). Then 

tti(A ) = (7^(04),..., x\{c 4 )) = {a\, 02,03, a 4 ) = G, 

and similarly 7 t 2 (K) = S 4 . Since the elements a*, as well as 6 *, satisfy relations ( 6 ), the 
elements c* satisfy these relations also, whence \K\ < 24 . As 7t 2 |k '■ K —> S 4 is surjective, 
\K\ < 24 , and | ^4 1 = 24 , we see that 7T2 \k must be an isomorphism. 

Let p = (7T 2 1 /\) —1 : S 4 —* K be the isomorphism, inverse to 7 T 2 1 jv- Then o = 774 \k 0 p '■ 
S 4 —> G is an epimorphism. Now it is sufficient to show that the kernel of o is trivial. 
It follows from the definitions that 7T 2 (cj) = 6*, whence p(6j) = c t and cr(6j) = 7Ti (p(6,)) = 
774 (d) = a l . If Kercr ^ 1 , then Kercr 3 V 3 64, because V is the only minimal normal 
subgroup of S4. But <7(64) = (i\ ^ 1 , a contradiction. Hence Kercr = 1 . □ 

Proposition 5.4 The group G = ($4, $2, $3, $4) is isomorphic to S 4 . 


Proof. It follows from Proposition 5.2 that the elements = a t satisfy all conditions of 
Lemma 5 . 3 . □ 
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5.3 Invariance of C under G 

In this subsection we prove that the Laderman algorithm £ is invariant under the group 
G = (<hj | i = 1,2, 3 , 4 ). To prove this, we need the following quite general statement. 

Lemma 5.5 Let G be a group acting on a set X, and N < G be a normal subgroup. Let 
O be an orbit of N on X. Then for any g G G the set gO = {gm \ m G Oj is also an 
N-orbit. In particular, if 0 \ and O2 are two N-orbits, x t G Oj, and g G G an element such 
that gx 1 = X2, then g bijectively maps 0 \ onto (P 2 . 

This statement is well known, and we leave its proof to the reader (for a proof it suffices 
to note that g(nx) = (gng~ 1 )(gx) for any x G A", n G N, and g G G\ so, if x and y are in 
the same IV-orbit, then gx and gy are in the same IV-orbit also). 

We also need to know the images of some of the tensors £,... ,t 2 3 under some of the 
transformations <hj. 

Lemma 5.6 The following relations hold: 

$1 : t\ 1 y ti, £ 2 ^ £i 3 > £3 l— £ In, £4 i—> £ 12 , £5 i—> £ 15 , £ 6 t-G £14, 

£7 l— £ £16, tg 1 £ tn, £91 £ t is, £10£ £ 10, £19 l— £ £19, £20 l— £ £22, 

£21 l— £ £23 : 

$2 : £i G £10, £2 l— £ £3, £3 l— £ £11, £4 l— £ £7) £5 l— £ £9) ^6 l— £ 

£19 l— £ £19, £20 l— £ £23 ; 

$3 : £6 l— £ te, £ 2 i-G £5, £4 1—£ £4, £ 19 1 —> 1 19, £ 2 3 1—£ £ 23 ; 

$4 : £1 1—£ £3 1—£ £ 6 , £ 2 1 y £ 2 , £4 1 ^ £4, £5 1 £ £5, £19 1—£ £ 19 , £ 23 1 —y £ 23 . 

Proof. A direct computation. Prove, for example, that <E>i(£ 3 ) = £n, = £3, and 

*^*3 (£2) = £5- 

We have £ 3 = e 22 <g) (—en + e i2 + e 2i - e 22 - e 23 - e 3 i + e 33 ) <g) e i2 . The transformation 
$1 acts by $1(2; ® y <g) z) = 77 23 a; 7 ri 3 <g) 7 Ti 3 y <g) Z 77 23 . Hence 

4 *i(£3) = 7723622^13 ® tti 3 (—en + ei 2 + e 2 i — e 22 — e 23 — e 3 i + e 33 ) ® ei 2 7 r 23 . 

Simplify multiplicands in the latter product. We have ei 2 7 r 23 = ei 2 (en + e 23 + e 32 ) = ei 2 e 23 = 
ei 3 . Generally, for any i = 1 , 2,3 we have e,;i 7 r 23 = e a , e i 2 77 23 = e i3 , and e i 3 7 r 23 = e i2 . That 
is, the multiplication of e t] by 7 T 23 on the right affects only the index j, by the rule 1 G 1 , 
2 G 3 . Similarly we can find any product of the form ne^Tr', where 77,77' G { 77 i 2 , 7743, 77 23 }. 
In particular, 77 23 e 22 77 i 3 = e 32 . 

Next, 77 i 3 (—en + e 12 + e 21 - e 22 - e 23 - e 3 i + e 33 ) = -e 3 i + e 32 + e 2i - e 22 - e 23 - e n + ei 3 . 
Thus we obtain 

®i(t 3 ) = e 32 ® (—e 3 i + e 32 + e 2 i — e 22 — e 23 — en + ei 3 ) ® ei 3 , 


11 ■ 


which coincide with £ 
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Similarly we have £1 = (en + e 12 + ei 3 — e 2 i — e 2 2-e 32 — e 33 )<8)e 2 2<8)e 2 i and = 

£i^7Ti2 <8> vri 2 a;7ri2£i <8> £i 7 ri 2 y£i, whence 

$4(^1) = £ie 2 ivri2 <£> 7 ri 2 (en + ei 2 + ei3 — e 2 i — e 22 — e3 2 — <£> £i^ne 22 £i ■ 

Simplify. For any i and j we have £\e\j = —e\j and £\e tJ = eij if i — 2 or 3 ; similarly 
e t \£\ = —Cji and e t j£i = e tJ if j = 2 , 3 . Hence 

£ie 2 ivri2 = e 2 i7Ti2 = e 22 , £i7ri 2 e 2 2£i = £iX\2^22 = £iei 2 = — ei 2 , 


and 

7Ti 2 (eii + ei 2 + ei3 — e 2 i — e 22 — e3 2 — e33)7ri 2 £i = (e 22 + e 2 i + e 2 3 — ei 2 — en — 631 — 633)61 

= e 22 — e 2 i + e 2 3 — ei 2 + en + e3i — 633 . 


Thus we obtain 

$4(^1) = 6 22 <g) (e 22 — e 2 i + e 2 3 — ei 2 + en + 631 — 633) <g) (—en), 
which is equal to 1 3 . 

Finally, t 2 = (en - e 2 i) <g) (—ei 2 + e 22 ) <g) (ei 2 + e 22 ) and <f> 3 (a; ®y®z) = y t e 2 < 8 > £ 2 x t < 8 > 
whence 

3*3 (£2) = ( — 612 + e 22 ) t 6 2 <8) £ 2 (en — e 2 i)* <g) (ei 2 + e 22 ) f = (—e 2 i + e 22 )£ 2 <g) e 2 (en — ei 2 ) <E> (e 2 i + e 22 ) 
— ( — 621 — e 22 ) <8) (en — 6 i 2 ) <8) (e 2 i + e 22 ) = (e 2 i + e 22 ) <g> (—en + 612) <£> (e 2 i + e 22 ) 

= £5 . 

□ 


Proposition 5.7 The set £ is invariant under the group G = ($1, $2, $3, $4). 

Proof. We will consider the action of G on the set of all decomposable tensors in L = 

L\ <S) L 2 $5 £ 3 . 

First show that £ is invariant under 4 >i, and break it into the orbits under the cyclic 
group ($1)2- By Lemma 5.6 we have <Li(£ 2 ) = £.3. As = 1 , we have also $i(ti 3 ) = 
< hi(*hi(£2)) = t 2 , so {£ 2 , 7 13} is a ($i) 2 -orbit. We abbreviate {£2,^13} to { 2 , 13 }. Similarly it 
follows from Lemma 5.6 that 


uq — { 1 }, ui 2 — ( 2 , 13 }, iv 3 — { 3 , 11 }, uq — ( 4 , 12 }, 


u; 5 = { 5 , 15 }, ujq = ( 6 , 14 }, ca 7 = ( 7 , 16 }, ca 8 = {8, 17 }, 

^9 = { 9 , 18 }, cuio = { 10 }, can = { 19 }, cai 2 = { 20 , 22 }, 
and CU13 = { 21 , 23 }, 

are ($i) 2 -orbits. In particular, £ is ($i) 2 -invariant. 

Next consider the subgroup H\ = ( < 3 >i,$ 2 )- As $1 and $2 commute, ($1)2 is normal 
in H\. As $2(£2) = ts and <h 2 — 1 , we have & 2 (ts) = t 2 . Now it follows from Lemma 5 . 5 , 
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applied to ($i) 2 <! Hi, that $2 bijectivcly maps oj 2 = { 2 , 13 } and tag = { 8 , 17 } each onto 
the other. Therefore 

uj 2 U cc 8 = { 2 , 13 , 8 , 17 } = { 2 , 8 , 13 , 17 } 
is an //i-orbit. Similarly we see that the following sets are Hi- orbits: 

Hi = { 1 , 10 } = u>i U taio , H2 = { 2 , 8 , 13 , 17 } = uj 2 U tag , fig = { 3 , 11 } = U3 , 

H4 — { 4 , 7 , 12 , 16 } — W4 U 0J7 , H5 — { 5 , 9 , 15 , 18 } — ou 5 U cag , fig — { 6 , 14 } = tag , 

H7 = { 19 } = tan , fin = { 20 , 21 , 22 , 23 } = tai2 U tai3 . 

Further we consider subgroup H 2 = ($1, $2, $4). ft follows from relations 4>4 < Fi < h7 1 = 
$i<h 2 and $4 $2 $4 1 = $1 that $4 normalizes Hi, so II\ is normal in H 2 . Similarly, the 
relations = 1, < L 3 < I > i < 3>3 = < hi < l > 2, < h 3 < l > 2 < l > 3 = $2, and <L3<L4<I<3 = imply that $3 
normalizes H 2l so H 2 < G. 

(One can observe (though this is not necessary) that under the isomorphism between G 
and S4, described in the proofs of Lemma 5.3 and Proposition 5 . 4 , the subgroups Hi and 
H 2 correspond to normal subgroups V and A 4 of S4.) 

By Lemma 5.6 $4 permutes cyclically ti, t 3 , and fg. So by Lemma 5.5 $4 cyclically 
permutes the //i-orbits fli = { 1 , 10 }, fl 3 = { 3 , 11 }, and fig = { 6 , 14 }. Therefore the set 
Si = fli U H3 U fig = { 1 , 3 , 6 , 10 , 11 , 14 } is an //2-orbit. Next, as each of the tensors £, where 
i = 2 , 4 , 5 , 19 , 23 , is $4-invariant, we see that each of the H i-orbits O2, H4, fl.5, H7, and fig is 
$4-invariant, and therefore is an //2-orbit. 

As $3 interchanges t 2 and f 5 and normalizes H 2 , it interchanges the Z/ 2 -orbits fl 2 and fl 5 . 
Therefore S 2 = H2 U fl 5 is a G-orbit. Next, as $3 hxes elements t 6 G Si, f 4 G fl 4 , tig G H7, 
and t 2 g G fig, we see that $3 leaves invariant their H 2 -orbits Si, fl 4 , O7, and fig. Thus 
we obtain that the sets Si = {1, 3 , 6 , 10 , 11 , 14 }, S 2 = O2 U O5 = {2, 5 , 8, 9 , 13 , 15 , 17 , 18 }, 
fl 4 = { 4 , 7 , 12 , 16 }, fl 7 = { 19 }, and fig = { 20 , 21 , 22 , 23 } are G-orbits. So their union 
£ = Si U S 2 U fl 4 U fl 7 U fig is invariant under G. □ 

5.4 The full group Aut(£) 

According to Proposition 5 . 7 , the group G = (< 3 >j | i — 1 , 2 , 3 , 4 ) is a subgroup of Aut(£). In 
this subsection we prove that G is the full automorphism group of £. 

Let Aut(£)o < Aut(£) be the subgroup of all elements that correspond to the identity 
permutation of {Li, L 2 , L 3 }. Also put Q 1 = ( < h 3 ,<l ) 4). 

Lemma 5.8 Aut(£) = Aut(£)gQi • 

Proof. Let n :T(t) —> S3 be the homomorphism taking each g e F(t) to the corresponding 
permutation of {Li, L 2 , L 3 }. Then ^($4) = ( 123 ) and ^($3) = ( 12 )( 3 ), whence tt(Qi) = 
(( 123 ), ( 12 )( 3 )) = S3. Therefore for each element g G Aut(£) there exists an element g' G Q\ 
such that 71(g) = n(g'). Put g" = g(g')~ 1 , then g = g"g'. Also x(g") = 7 i(g) 7 i(g') _1 = e, and 
therefore g" G Aut(£) 0 . So g G Aut(£) 0 Qi- As g was an arbitrary element of Aut(£), we 
obtain that Aut(£) = Aut(£)o< 3 i. □ 

It follows from Proposition 4.8 that the elements of Aut(£)o are precisely the elements 
of Aut(£) of the form g = T(a, b, c), for some a, b, c G GL 3 (K). 

Introduce a notion which will play important role in the rest of the paper. 
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Tensor projections. Let U ®V be the product of two spaces. For any two subspaces 
X , L C [/we have A® FriT ® V = (A^fl Y') ® V. It follows that for any subspace L C U®V 
there exists the least (i.e., the unique minimal) subspace X C U such that L C X ® V. We 
call X the tensor projection of L to U, and denote this by A" = tpr v L. 

The tensor projection tpiyL to the second factor is defined similarly. Also, for an element 
x G U ® V we write tpr^x for tpr (7 (a;). 

Generally, if U — U\ ® ... ® U m is the product of several spaces, and U — ® ... ® Ui v 
where 1 < i\ < ... < ii < m, is a “subproduct”, then we can dehne tpr^L, the tensor 
projection to U, for any subspace LC[/. 

It is more or less clear that operation of taking tensor projections has transitivity prop¬ 
erty; for example, 

tp% 2 (tpr um 1 ) = tP r u 2 L 

for any L C Ui ® U2 ® U3. 

Let t/ ® V he the product of two spaces, and X C U ® V be a set of nonzero elements 
of U ® V. Dehne 

tpiy/A = {tpr^a: | x G A}. ( 9 ) 

This is a set of nonzero subspaces of U. It may happen that tpr v x = tpr v y for two distinct 
x, y G X, x ^ y. So we shall consider t.pr c/ A as a multiset , i.e., a set with multiplicities (at 
least in the case if X is finite). Note that we can dehne tpr V X by formula ( 9 ) for a multiset 
X also. 

The operation of taking tensor projections has certain invariance properties. The follow¬ 
ing lemma is evident, but its accurate proof (which is left to the reader) may be tedious. 

Lemma 5.9 Let U — U\ ® ... ® U m be a tensor product of several spaces, X C U be a finite 
(multi)subset of nonzero tensors, and let tp G S(Ui,, U m ) be a decomposable automorphism 
ofU such that (p(X) = X. Suppose that takes factor Ui to Uj, and let if : U x —> Uj be the 
corresponding isomorphism (if is defined up to constant). Then if takes (multi)set tpr v .X 
to tpr^A. 

Return to considering group Aut(£). We need the following lemma. 

Lemma 5.10 Let V = (ei, e2,ef) be a three-dimensional space, and let 7t 2 3 G GL(V ) acts 
by vr 2 3 : e\ eG e.\, e 2 GG e 3 . Let G GL(V) be a transformation preserving the multiset of 
lines 

A = {1 • (ei), 4 • (e 2 ), 4 • (e 3 ), 2 • (ei — e 2 ), 2 • (e\ — e 3 )}. 

Then either p> = A • idy or (p = A7t 23 , where A G K*. 

Proof. It is clear that tp preserves each of the three sets of lines {(ei)}, {(e 2 ),(e 3 )}, and 
{(ei — e 2 ), (ei — 63)}. So p> either leaves each of the lines (ef) invariant, or preserves (ei) and 
interchanges (e 2 ) and (e 3 ). 

In the hrst case we have ^{ef) = diCi, where a* G K*. Then p>[e\ — e 2 ) = aiei — a 2 e 2 . 
The latter vector must be proportional to e\ — e 2 or to e\ — e 3 , so it is proportional to e\ — e 2 
and ai = a- 2 . Similarly eq = a 3 , and therefore <p — a\- idy. 

In the second case tp acts as e\ e-> aiei, e 2 hg a 2 e 3 , e 3 hg a 3 e 2 . The line (ei — e 2 ) goes to 
(aiei — a 2 es), whence a 2 = a\. Similarly a 3 = and therefore p> = aivr 23 . □ 
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Lemma 5.11 Consider the following set of one-dimensional subspaces in the space M = 

C = { (en — 621), (e2i + 6-22), ( — Cn + e3i), (e%\ + e32), (ei3 — 633), (e 32 + 633), 

( e 13 — e 23), ( e 22 + e 2 3), (ei 2 ), (^23)) ( e 2l), ( e 3l); ( e 33)}- 

Lei a, b G GL 3 (K ) 6 e elements such that the transformation (p : x (->• axb preserves C. Then 

a G {1, 7 r 23 } and be{l,n 13 } 


up to scalar factors. 


Proof. Let V and V' be spaces of 3-columns and 3-rows, respectively. We can identify 
M = M 3 (K) with VfgiV 7 by the isomorphism a : VfgiV 7 —> M 3 (K) defined by a{y®v') = W 
(cf. Subsection 4.1). 

Note that a matrix x G M is of rank 1 if and only if the corresponding tensor a ® 1 (a;) G 
V ® V 7 is decomposable, a~ 1 (a;) = v ® v' . The set C = a _ 1 (C), corresponding to C under a , 
consists of 13 lines in V ®V' spanned by decomposable tensors, namely 

C = a -1 (C) = { ((ei - e 2 ) <8 e 1 ), (e 2 (e 1 + e 2 )), ((d - e 3 ) ® e 1 ), 

(e 3 ® (e 1 + e 2 )), ((ei - e 3 ) ® e 3 ), (e 3 ® (e 2 + e 3 )), ((d - e 2 ) ® e 3 ), 

(e 2 ® (e 2 + e 3 )), (ei ® e 2 ), (e 2 ® e 3 ), (e 2 <8> e 1 ), (e 3 ® e 1 ), (e 3 ® e 3 )}. 

Let ip' \ V ®V' — » V ® W be the automorphism corresponding to p under a, that 
is, ip' = a~ x pa. It is easy to see that ip' is decomposable, namely p>' = p>\ ® p 2 , where 
pi : V —> V and P2 '■ V' —* V' are defined by pi{x) = ax and P2(y) = yb. Indeed, for any 
v G V and v' G V' we have 


p\v®v') = ( a 1 pa){v ® v') — a 1 (p(a(v ® v'))) — a 1 (p{vv')) = a 1 (avv'b ) 

= a _1 ((au)(t/6)) = av ® v'b = </?i(v) ® ^(n') = {pi ® </? 2 )(v <E> n 7 )- 

Since C is invariant under p, we see that C' must be invariant under p'. Therefore the 
tensor projections tpr V C = C\ and tpr V ,C = C' 2 must be invariant (as multisets) under pi 
and p 2 , respectively. 

We can immediately see that 

C[ = {1 • (ei),4 • (e 2 ), 4 • (e 3 ),2 ■ (ei - e 2 ),2 • (ei - e 3 )} 


and 

C' = {4 • (e 1 ), 1 • (e 2 ), 4 • (e 3 ), 2 • (e 1 + e 2 ), 2 • (e 2 + e 3 )}. 


It follows from Lemma 5.10 that a — 1 or a — n 2 3 , up to a scalar. Similarly one can prove 
that b = 1 or b = 7Ti 3 np to a scalar. □ 


Proposition 5.12 The equality Aut(£) 0 = ( < f > i, < f ) 2 ) holds. 
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Proof. The inclusion Aut(£)o 7> ($ 1 , <h 2 ) is obvious, because both $1 and $2 are of the form 
T(a,b,c). We need to prove the inverse inclusion. 

Let u = ui®u 2 ®u 3 be a decomposable tensor of L\®L 2 ®L 3 . The triple (rk(wi), rk(u 2 ), rk(u 3 )) 
will be called the type of u. The tensors of type (1,1,1) in £ are £ with i = 2, 5, 8 , 9, 13,15, 

17, 18, 19, 20, 21, 22, 23, of type (2,1,1) — £ with i = 1,10, (1,2,1) — with i = 3,11, 

(1,1, 2) — i = 6,14, and (2, 2, 2) — i = 4, 7, 12, 16 (and there are no tensors of other types, 
say (2,2,3), in £). 

By B we denote the set of all £ of type (1,1,1). 

Observe that for any x G GL 3 (K) and y G M 3 (K) we have rk (xy) = rk (yx) = rk (y). 
Hence for arbitrary decomposable tensor u = u\ <g) u 2 ® u 3 G L and arbitrary transformation 
of the form g = T(a, b, c ) the types of tensors u and g(u ) coincide, ft follows that any element 
g G Aut(£)o preserves B. 

Further, let g = T(a, b, c ) be any element of Aut(£)o- As g preserves B, the set of tensor 
projections C = tpr Ll H must be invariant under transformation ip : x 1 —>• axb~ 1 . It is easy to 
see that 

£ = { (eii — e 2 i), (e 2 i + e 22 ), ( — en + e 3 \), (e 3 \ + e 3 2 ), (ei 3 — 633 ), (e 3 2 + 633 ), 

( e i 3 — e 23 ), (e 22 + e 23 ), (ei 2 ), (e 23 ), (e 2 i), (e 3 i), (e 33 )}. 

By Lemma 5.11, g has the form g = T( 7 r| 3 , ^( 3 , c), for some e,g G {0,1}. Remembering 
that $1 = T(7r 23 ,7r 13 ,l) and <h 2 = T(vr 23 ,1, 7t 23 ), we see that there exist (uniquely defined) 

7 , 5 G {0,1} and g' of the form g' = T( 1, l,c') such that g = $l$ 2 g'. So it is sufficient to 
show that any element g' of the form g' = T(l, l,c'), leaving B invariant, coincide with the 
identity map. 

Obviously, g' acts on Li trivially (up to a scalar). Note also that the tensor projections of 
all elements of B to Li are pairwise distinct. It follows that g' fixes each element of B. So the 
map x 1 —^ xc~ x preserves any subspace of the form tpr La u, where v G B. Hence easily follows 
(by an argument similar to the proof of Lemma 5.11) that c is a scalar, whence g' — 1. □ 

Thus, the statement of Theorem 1.1, concerning the Laderman algorithm, is established. 

5.5 Some details of calculations 

In the arguments of Subsections 5.2-5.4 we used the transformations $ 1 , <h 2 , $ 3 , $4 “in 
ready form”, but the reader may ask (and the author should explain) how these transfor¬ 
mations were found. (Though, it is not difficult to have an idea of this from arguments of 
Subsection 5.4). 

The key idea is to decompose each of the spaces L, = M 3 3 (i = 1, 2, 3) as £* = < 8 ) V), 

where U % (resp., Vf) are three copies of the space of 3-columns (resp., 3-rows). Then L = 
L\® L 2 ® £3 decomposes as 


L = Ui <g> Hi <g> U 2 <g> V 2 ® U 3 <g> V 3 . (10) 

It follows from Theorem 4.12 that any element of F(t) is a decomposable automorphism of 
L with respect to decomposition (10). Next, a decomposable tensor u = u\® u 2 ® u 3 E L 
is decomposable with respect to (10) if and only if it is of type (1,1,1). It was noted above 
that Aut(£) must preserve the set B of all elements of £ of type (1,1,1). 
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We can consider B as a set of 13 tensors decomposable with respect to (10). It is 
convenient to write B as a table (see Table 1). 


Figure 1: 
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For example, the 3-rd row of the table reads as follows: 

h = (-ei + e 3 ) ® e 1 ® (ei - e 2 ) <S> e 3 <g) (e 3 + e 3 ) <g) e 3 = (-e n + e 31 ) ® (e i3 - e 23 ) <8> (e i3 + e 33 ). 

Assume that (7 G Aut(£) preserves each of the factors L l5 L 2 , and L 3 . Then g has the 
form g = T(a,b,c), in particular g preserves all factors Ui, V) of decomposition (10). Since 
g preserves B , it follows that the transformation x 1 —>• ax preserves the multiset 


fp% 1 ^ = {!■ (ei),4- (e 2 ),4 • (e 3 ),2- (ei -e 2 ),2- (ei - e 3 )}, 

whence a — 1 or a — 7r 23 , up to a scalar. Similarly one can show that b = 7t^ 3 and c = 1 t° 3 , 
for some 77 ,6 G Z 2 = {0,1}. Therefore 

9 = T (^23^13^23) 

for some e, 77 , 9 G Z 2 ■ It is easy to check that any element of the latter form with e + 0 + r) = 0 
preserves B, while the element with (e, r],6) = (0,0,1) (and therefore any element with 
e + 9 + 77 = 1) does not. 

Then the author has checked that elements with e + 9 + 77 = 0 preserve the set C\B 
(i.e., the set of all elements of C whose type is different from ( 1 , 1 , 1 )) also, and so preserve 
the whole £. Thus, the group Aut(£) D T°(f) turns out to be completely described. 

Next we should consider elements of Aut(£) corresponding to nontrivial permutation of 
the factors L 1 , L 2 , and L 3 . 

The observation that C contains 13 and 4 elements of types (1,1,1) and (2, 2, 2), respec¬ 
tively, and 2 elements of each of the types ( 2 , 1 , 1 ), ( 1 , 2 , 1 ), and ( 1 , 1 , 2 ), suggests that the 
group of permutations of factors induced by Aut(£) is either Z 3 or S :i . 

Indeed, using Lemma 5.9 and the table above, it is not hard to find candidates for 
automorphisms corresponding to nontrivial permutations of L\, L 2 , L 3 . A candidate for an 
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automorphism corresponding to (12)(3) can be found from the table very easily. In fact, this 
candidate is nothing else but $ 3 . As to an automorphism corresponding to (123), to find it 
is a more complicated task. 

Finally note that the checking that £ is invariant under all was made by the author 
directly, without using Lemma 5.5 and relations of Proposition 5.2. (These relations were 
found later, in order to streamline the argument in the present text.) 


6 Automorphisms of the Hopcroft algorithm 


6.1 Hopcroft algorithm 

Recall the description of the Hopcroft algorithm (in computational form), according to [24], 
Let 

(X^^ Xio\ / 

2/11 2/12 2/13 
2/21 2/22 2/23 



(Xu 

X12 \ 

X = 

X21 

2:22 


\x 31 

X32J 

Zij of the matrix 


z 11 

212 

Z 13 

221 

Z22 

Z23 

231 

Z 3 2 

Z33 


can be computed by formulae 

Zll = £l + £2 , z 22 = £3 + £4 , z 33 =p 5 +Pa, 


z 12 = ~P 2 - P3 + P7 - Ps , 221 = ~P 1 ~ £4 + £8 ~ £9 , 

213 = ~Pl ~ Pb ~ £13 + PlS , 2 3 i = -p 2 - £6 + £13 ~ Pl4 , 

^23 = -p:i ~P6+ Pll - Pl 2 , ^32 = Pa - Ps + Pw - Pll , 

where 

Pi = (am £12)2/11 > £2=2:12(2/11+2/21), £3 = 2:212/12, 

£4 = 2:222/22, £5=2:31(2/13 + 2/23), £e = (-2:31 + 2:32)2/23 , 

£7 = (2:11 + 2:21X2/11 + 2/12 + 2/21 + 2/22), 

£s = (2:11 - 2:12 + 2:21)(' 2 /n + 2/21 + 2/22), 

£9 = (2:11 - 2:12 + x 2 i - 2:22)(2/21 + 2/22), 

£10 = (2:22 + 2:32X2/12 + 2/13 + 2/22 + 2/23), 

£11 = (2:22 - 2:31 + 2:32) (2/12 + 2/13 + 2/23), 

£12 = ( — 2:21 + 2:22 — 2:31 + 2:32x2/12 + 2/13), 

£13 = (2:12 + X 31 )(y n - y 23 ), pu = (-2:12 - x 32 )(y 2 i + 2/23), 
£15 = (2:11 + x 31 )(y u + y 13 ). 
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Further, present Hopcroft algorithm in tensor form. Consider the following decomposable 
tensors in the space ild 32 <g> M 23 <S> M 33 : 

t\ = (cn — ei 2 ) < 8 ) eii ® (en — e 3 i — ei 2 ), 

^2 = e 32 < 8 ) (en + e 2 i) < 8 ) (en — e 23 — ei 3 ), 

0 — e 2i ® e 32 <g) (—e 2 i + e 22 — e 32 ), 

0 = e 22 <g) e 22 < 8 ) (—e 32 + e 22 — e 23 ), 
tb — e 3 i ® (ei3 + e 23 ) < 8 ) (e 33 — e 3 i — e 23 ), 

^6 — ( —e 3i + e 32 ) ® ?23 ® (e 33 — ei 3 — e 32 ), 
h — (en + e 2 i) ® (en + e 32 + e 2 i + e 22 ) ® e 2 i, 

ts — (oi — ei 2 + e 2 i) ® (en + e 23 + e 22 ) ® (e 32 — e 2 i), 

0 — ( e n — e i2 + e 2 i — e 22 ) < 8 ) (e 2 i + e 22 ) < 8 ) (—ei 2 ), 

Ho = (^22 + e 32 ) <S> (e 32 + ei 3 + e 22 + e 23 ) e 23 , 

hi = (e22 — e 3 i + e 32 ) < 8 > (e 32 + ei 3 + e 23 ) <E> (—e 23 + e 32 ), 
tn = ( — C2i + e 22 — e 3 i + e 32 ) <g> (e 32 + e 33 ) < 8 > (—e3 2 ), 
f 13 — ( e i2 + e 3 i) <£) (en — e 23 ) < 8 > (ei 3 — e 3 i), 
tu = (02 + e 3 2) <£) (e 2 i + e 23 ) ei 3 , 
hs — ( e n + e 3 i) ® (en + ei 3 ) <E> e 3 i. 

Proposition 6.1 The set TL = {ti ,..., } is the tensor form of the Hopcroft algorithm. 

Proof. A direct computation, left to the reader. □ 


6.2 The group Aut('H) 

In this subsection we find the automorphism group of the Hopcroft algorithm. Our arguments 
are similar to those for Laderman’s algorithm, cf. Subsections 5.2-5.4. So we provide only 
the results of computations,usually, leaving the details to the reader. 

Let Li = M 32 , L -2 = M 23 , L 3 = M 33 , and L — Li ® L 2 ® L 3 . We can consider transfor¬ 
mations of L of the form T(a, b, c), where a, c € GL 3 (K) and b G GL 2 (K ). 

Put d = ^ G GL 2 (K), then d 3 = 1 and d -1 = ^ j ^ . Also put 

/0 0 1 
7Ti2 3 =10 0 

\0 1 0 

Let Cij be a matrix unit in one of the spaces M 32 , M 23 , or M 33 . Observe that the multipli¬ 
cation of the matrix by 7 Ti 23 on the left, resp. by 7^3 on the right, shifts the subscript 
i (resp., j ) by 1: 7r 123 e,j = e i+ ij (if the product ni 23 ejj makes sense, that is, if e l3 G M 32 or 
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M 33 ), and similarly e lJ r:ff 3 = G,j+ 1 , if € M 23 or M 33 . Here subscripts i + 1 (j + 1) are 
taken modulo 3, so that 3 + 1 = 1. 

Consider transformations 


4>1 = T(7Ti23, d, VTi 2 3 ) 


x <g) y <g) £ h+ 7Ti23a: 




1 \ _! _! 
x ) 2 / 7+23 ® 7 + 23 - 7+23 , 


$2 = T(7Ti 3 , 7Tl2, 7 Ti 3 ) \ X ® y ® Z ® X 12 y7l 13 <g> 7l 13 Z7T 13 , 


and 


$3 2/Vi2£i <8> £1^12^ (8) z* 

(here notation 7 t 12 , 7t 13 , and £i have the same meaning as in Subsection 5.2). 

Lemma 6.2 The transformations $ 1 , $ 2 , and <h 3 satisfy the following relations: 

$3 = $2 = $2 = 1? = $-l ; 


— < 1 > 1 < 1 > 3 , < f > 3 < f > 2 — < 1 > 2 < 1 > 3 . 

Proof. Recall that the map by the rule ( a,b,c ) 1 —> T(a,b,c ) is a group homomorphism 
(from GL 3 {K ) x GL 2 (K ) x GL 3 {K ) to GL(L)). So the relations = $ 2 = 1 follow from 
7r i 23 = ^3 = 1 and d 3 = 7 if 2 = 1. Similarly, the relation < h 2 < hi < l > 2 = 'f’b 1 follows from 
7+371-12371-13 = 71(33 anc l 7112^7112 = d^ 1 . The remaining three relations can be proved by direct 
computations (cf. the proof of Proposition 5.2). □ 


Lemma 6.3 Let G be a group generated by three elements a 3 , a 2 , a 3 . Suppose that ai satisfy 
relations 

o 3 = Q -2 = ®3 = 1 ) a 2 a\a 2 = ci i o.yi i = O 1 O 3 , a 3 a 2 — a 2 a 3 . (ff) 

Suppose also that aq 7 ^ 1 and a 3 7 ^ 1. Then G = S 3 x Z 2 . 

Proof. The argument is similar to that in the proof of Lemma 5.3. Note first that system (11) 
is equivalent to 

o ' 3 = a 2 = O 3 = 1 , a 2 a\ = a^a 2 , a 3 a\ = Q 1 O 3 , a 3 a 2 = a 2 a 3 . ( 12 ) 

Next, consider the group S 3 x Z 2 . The elements of Z 2 will be denoted by 0 and 1. In 
S 3 x Z 2 consider the elements 61 = ((123), 0), b 2 = ((12), 0), and b 3 = (e, 1). Clearly, they 
satisfy relations ( 11 ) and ( 12 ). 

Further, observe that if X is any group generated by three elements c t satisfying rela¬ 
tions (12) (with a,; replaced by cf), then any element of X can be written as c l fc 2 c 3 , where 
0 < h < 2, 0 < l 2 , l 3 < 1. Hence |X| < 12 = |S '3 x Z 2 \. 

Finally, it is easy to see that any minimal normal subgroup of S 3 x Z 2 is either (bf ) 3 or 

(h) 2 - 

Using the facts observed it is not hard to give a proof similar to that of Lemma 5.3. The 
details are left to the reader. □ 


The following statement is an obvious corollary of Lemmas 6.2 and 6.3. 
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Corollary 6.4 The group G = (4>i, $ 2 , $ 3 ) is isomorphic to S 3 x Z- 2 . 

Lemma 6.5 The following relations for action of transformations 4>j on tensors tj hold: 

$1 : t\ 1 y £4 1 ^ £5 , t 2 1—>• £3 1—> , £7 H> t\2 H> t 14 , 

£9 l— >• £10 ^ £15 , 1-8 H > tn l— 1 1\3 ; 

4*2 : £1 1 y tg , ti4 1 ^ ti5 , £ 8 ^ £111 

$3 : £1 H > £2 , £7 l— > £9 j ts tg . 

Proof. A direct computation. □ 

Proposition 6.6 Consider the action of the group G = (4>i, 4> 2 , 4> 3 ) on the set of nonzero 
decomposable tensors in L. Then the sets 

Hi = {£ 1 ,..., £ 6 } = {1, 2, 3,4, 5, 6 }, 

fl 2 = {7,9,10,12,14,15}, 

and 

n 3 = {8,11,13} 

are G-orbits. hi particular, the set 

TL = Hi U 1^2 U H3 = {£1,..., £15} 


is invariant under G. 

Proof. The argument is similar to the proof of Proposition 5 . 7 . First of all, it follows from 
Lemma 6.5 and the relation = 1 that the sets 

k-h = {£ 1 , £ 4 , £ 5 } = {1,4, 5}, uj 2 = {2,3,6}, U 3 — {7,12,14}, 

0 J 4 = {8,11,13}, and CJ 5 = {9,10,15} 

are orbits of the group (4*1)3. Next, as 4 > 2 normalizes ( 4 > i) 3 , it follows from Lemma 5.5 and 
the relation 

4*2 : £1 £ 6 , t 14 1—>• £ 15 , £ 8 1 —> tu , 

that 4*2 interchanges oj\ and u; 2 , c 03 and u> 5 , and preserves ta 4 . So the sets Hi = U ta 2 , 
112 = ojz U 0J5, and H3 = 0J4, are orbits under the group ( 4 >i, 4 > 2 ). Finally, as 4> 3 normalizes 
(even centralizes) ( 4 >i, 4*2), it follows from the action of 4>3 on £1, £7, and £ 8 , that each of the 
( 4 >i, 4>2)-orbits Hi, H2, and H3 is invariant under 4>3. So H* is an orbit under G. □ 


Corollary 6.7 G = (4>i, 4> 2 , 4> 3 ) is a subgroup of Aut('H). 

In the rest of this section we prove that, in fact, Aut("H) = G. 

Let Aut('H)o < Aut('H) be the subgroup of all elements preserving each of the factors 
L 1 , L 2 , and L 3 . The following lemma and its proof are similar to Lemma 5.8. 


Lemma 6.8 We have Aut("H) = Aut('H) 0 (4 > 3 ) 2 . 
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Proof. Let n : F(t) —» S 3 be the homomorphism that assigns to each element g G T(t) 
the corresponding permutation of the factors L 1 , L 2 , and L 3 . As dim L\ = dim L 2 = 6 
and dimZ /3 = 9, we have 7 r(T(t)) C {e, (12)(3)} (and, actually, 7 r(T(t)) = {e, (12)(3)} by 
Theorem 4.12). On the other hand, it is clear that ^($ 3 ) = (12)(3). The rest of the proof is 
similar to that of Lemma 5.8. □ 

Let V and V' be spaces of 3-columns and 3-rows, respectively, (ei, 62 , 63 ) and (e^e^e 3 ) 
be the usual bases of V and V', and let GL 3 (K ) acts on V and V' in the usual way. Let 
c 6 S 3 be a permutation. By o we denote the element of GL 3 (K) permuting basis vectors 
(ej) according to a. 

Lemma 6.9 Let p G GL 3 (K ) be an element such that the sets of three lines 

A” = {(ei + ef), (ei + 63 ), (e 2 + 63 )} 


and 

y = {(e 1 )Ae 2 )Ae 3 )} 

in V and V' are invariant under p. Then p = Xd for some o G S 3 and A G K*. 

Proof. Note that for any cr G S 3 the transformation a preserves both X and y. Moreover, 
the permutation by which 0 acts on y coincides with a. Now let p be as in the hypothesis 
of the lemma, and o be the permutation by which p acts on y. Then the transformation 
p' = o~ 1 p leaves each of the lines (e*) invariant. Therefore p' = diag(Ai, A 2 , A 3 ) for some 
A i G K*. Also, p' preserves X. So the line p'({e 1 + 62 )) = (Aiei + ^ 2 ) must be in 
A, and therefore it coincides with (ei + ef)^ whence Ai = A 2 . Similarly Ai = A 3 , whence 
p' = diag(A 1 , Ai, Ai) = Ai E. So p = ap' = Aid. □ 

Proposition 6.10 The equality Aut("H)o = ($ 1 , $ 2 ) holds. 

Proof. The argument is mainly similar to the proof of Proposition 5.12. Any element of 
Aut("H)o is of the form T(a,b,c), for some ( a,b,c ) G GL 3 (K) x GL 2 (K) x GL 3 (K). 

For any m, n G N (not necessary m = n) and any x G GL m (K), y G GL n (K), z G M mn 
we have rk (xz) = rk (zy) = rk(^). So for any decomposable tensor u = u\ ® u 2 <E) u 3 G 
M mn <g) M np <g) M pm and any element g = T(a,b,c), where a G GL m (K), b G GL n (K ), 
c G GL P (K), the tensors u and g(u) are of the same type. Therefore Aut("H)o preserves the 
subset of all tensors of type (1,1,1) in TL. It is easy to see that this subset is 

Q 2 = {ti\i = 7,9,10,12,14,15}. 

Thus, it is sufficient to prove the following statement: 

(*) If a transformation g = T(a,b,c), where a, c G GL 3 (K) and b G GL 2 (K), 
leaves the set H 2 invariant, then g G ($ 1 , 4 > 2). 

Let D and F (resp., D' and F') be two copies of the space of 3-columns (resp., 3-rows), 
and let E and E' be spaces of 2-columns and 2-rows, respectively. Consider tensor product 


N = D®E'®E®F'®F®D’. 
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We can identify N with L = M 32 ® M 23 <S> M 33 by the isomorphism r : N —» L defined by 

r(d ® e' ® e ® f ® f ® d!) = de' ® ef' ® fd! 


(cf. Subsection 4.2). 

Let B and g' be the subset and the transformation of N, corresponding to 1^2 and g, 
respectively, with respect to the isomorphism r. That is, B = T -1 (fi 2 ) and g' = r" 1 gr. 
Then g'{B ) = (t " 1 gfij " 1 {ft 2 )) = T~ 1 {g{ft 2 )) = T _1 (fl 2 ) = B , that is, g' preserves B. 

It is easy to write B and g' explicitly. Namely, 

B = { (ei + e 2 ) <S> e 1 ® (ei + e 2 ) ® (e 1 + e 2 ) ® e 2 ® e 1 , 

(ei + e 2 ) <8> (-e 1 + e 2 ) ® e 2 ® (e 1 + e 2 ) ® e± ® e 2 , 

(e 2 + e 3 ) ®e 2 ® (ei + e 2 ) 0 (e 2 + e 3 ) ® e 2 ® e 3 , 

(e 2 + e 3 ) ® (e 1 - e 2 ) ® ei ® (e 2 + e 3 ) <g) e 3 ® e 2 , 

(ei + e 3 ) ® e 2 ® e 2 ® (e 1 + e 3 ) ® ei ® e 3 , 

(ei + e 3 ) ® e 1 ® ei ® (e 1 + e 3 ) ® e 3 ® e 1 }. 

Also it is easy to see that g' acts according to the formula 

g\d ® e' ® e ® f ® / ® d') = ad® e'b" x ®be® f'c " 1 ® cf ® d'a -1 . (13) 

Indeed, we have 

g(r(d ® e' ® e ® f ® f ® d')) = g{de' ® ef ® fd ') 

= T(a, &, c){de' ® ef ® fd') = ade'b " 1 ® be f'c " 1 ® cfd'a" 1 . 

But the latter expression coincides with 

r(ad ® eb " 1 ®be® f c " 1 ® cf ® d'a" 1 ). 


Therefore, 


g\d® e' ® e® f ® f ® d') = (r ^r) (d® e' ® e ® f ® f ® d') 

= r _1 (ade'6 _1 <S> be f'c" 1 ® cfd'a _1 ) = ad® e'b" 1 ® be ® f'c" 1 ® cf ® d'a -1 , 
which proves formula (13). 

Equality g'(£>) = £>, together with formula (13), imply that the tensor projection tpr^L) 
is invariant under the transformation d 1 —>■ ad (d G D). It is immediately seen that 


tpr d B = {(ei + e 2 ), (ei + e 3 ), (e 2 + e 3 )}. 


Therefore the transformation d 1 —>■ ad preserves the latter set. Similarly, 

tpv D ,B = {(e 1 ), (e 2 ), (e 3 )} 

must be invariant under transformation d! > d'a" 1 . So the element a G GLfK) satisfies 
hypothesys of Lemma 6.9, whence a = Xd for some A G K* and a permutation a G S3. 
Thus, g = T(\d, b, c ) = T{d, b, c ). 
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Remembering that $1 = T( 7 ri 23 , -I , 7 Ti 2 3 ) and <P 2 = T(tt 13 ,^ 12 ,^ 13 ), and taking 

into account that ( 123 ) and ( 13 )( 2 ) generate S 3 , we see that there exists an element gi G 
($ 1 , <P 2 ) of the form g\ = T{a,b\,ci). Therefore the element g 2 = gf l g is of the form 
T(1,6 2 ,c 2 ). Moreover, it is clear that g 2 preserves fl 2 . So it is sufficient to show that if an 
element g 2 of the form T(l, 6 2 ,c 2 ) preserves fl 2 , then g 2 G ( < hi, < h 2 ). We shall prove even 
more, namely that g 2 = 1 . 

Let g 2 = r~ 1 g 2 r be the transformation of N, corresponding to g 2 . Clearly, g 2 preserves B. 
Further, formula ( 13 ) immediately implies that for any element v G N the tensor projections 
of v and g 2 (v) to D ® D' coincide: 

tp *d®d' 92 ( v ) = tP *D®iy v > Vt: G N. 

But the tensor projections of all elements of B to D <S> D' are pairwise distinct, whence 
g 2 iy) = v. That is, g' 2 hxes each element of B. It follows that the transformation / 1 —>■ c 2 f 
preserves tensor projection to F of each element of B, so preserves each of three lines (ei), 
(e 2 ), and (e 3 ). Similarly, the transformation f 1 —> f'c. 7 1 preserves F'-projection of each 
element of B, that is, preserves each of the three lines (e 1 + e 2 ), (e 1 + e 3 ), and (e 2 + e 3 ). 
Hence easily follows that c 2 is a scalar. 

Finally, the transformation e 1 —> b 2 e preserves tensor projection to E of each element of 
B, and therefore preserves each of the lines (ei), (e 2 ), and (ei + e 2 ). Hence b 2 is a scalar also. 
As both b 2 and c 2 are scalars, we obtain that g 2 = T( 1 , b 2 , c 2 ) = 1 (= icR). □ 

We summarize the results of the present section in the following proposition. 

Proposition 6.11 The group Aut (Ti) coincides with G = ($ 1 , <h 2 , $ 3 ). The latter group is 
isomorphic to S 3 x Z 2 . 

The latter proposition proves the part of Theorem 1.1 concerning the Hopcroft algorithm. 
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